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ASSET SURVEILLANCE SYSTEM: 
APPARATUS AND METHOD 

CROSS-REFERENCE TO RELATED 

APPLICATIONS 5 

This application is a continuation-in-part patent applica- 
tion of U.S. application Ser. No. 10/095,835, filed Mar. 8, 
2002, now U.S. Pat. No. 6,892,163. 
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STATEMENT REGARDING FEDERALLY 
SPONSORED RESEARCH AND 
DEVELOPMENT 

The invention described herein was made in the perfor- 15 
mance of work under NASA Small Business Technology 
Transfer Research (STTR) Contract NAS8-98027, NASA 
Small Business Innovation Research (SBIR) Contract 
NAS4-99012, and NASA Small Business Technology 
Transfer Research (STTR) Contract NAS 13-01 001 and is 20 
subject to the provisions of Public Law 96-517 (35 USC 
202) and the Code of Federal Regulations 48 CFR 52.227-1 1 
as modified by 48 CFR 1 852.227- 1 1 , in which the contractor 
has elected to retain title. 
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FIELD OF THE INVENTION 

This invention relates generally to an asset surveillance 
system: apparatus and method and, in particular, to an asset 
surveillance system: apparatus and method for performing 30 
surveillance of an asset such as a process and/or apparatus 
wherein the surveillance is performed using a adaptive 
sequential probability (ASP) fault detection test for provid- 
ing an indication of an asset fault condition which may be 
followed by fault cause determination and indication. 35 

BACKGROUND OF THE INVENTION 

Conventional process surveillance schemes are sensitive 
only to gross changes in the mean value of a process signal 40 
or to large steps or spikes that exceed some threshold limit 
value. These conventional methods suffer from either a large 
number of false alarms (if thresholds are set too close to 
normal operating levels) or from a large number of missed 
(or delayed) alarms (if the thresholds are set too expan- 45 
sively). Moreover, most conventional methods cannot per- 
ceive the onset of a process disturbance or sensor signal 
error that gives rise to a signal below the threshold level or 
an alarm condition. Most conventional methods also do not 
account for the relationship between measurements made by 50 
one sensor relative to another redundant sensor or between 
measurements made by one sensor relative to predicted 
values for the sensor. 

Recently, improved methods for process surveillance 
have developed from the application of certain aspects of 55 
artificial intelligence technology. Specifically, parameter 
estimation methods have been developed using either sta- 
tistical, mathematical or neural network techniques to learn 
a model of the normal patterns present in a system of process 
signals. After learning these patterns, the learned model is 60 
used as a parameter estimator to create one or more pre- 
dicted (virtual) signals given a new observation of the actual 
process signals. Further, high sensitivity surveillance meth- 
ods have been developed for detecting process and signal 
faults by analysis of a mathematical comparison between an 65 
actual process signal and its virtual signal counterpart. In 
particular, such a mathematical comparison is most often 


2 

performed on a residual error signal computed as, for 
example, the difference between an actual process signal and 
its virtual signal counterpart. 

Parameter estimation based surveillance schemes have 
been shown to provide improved surveillance relative to 
conventional schemes for a wide variety of assets including 
industrial, utility, business, medical, transportation, finan- 
cial, and biological systems. However, parameter estimation 
based surveillance schemes have in general shown limited 
success when applied to complex processes. Applicants 
recognize and believe that this is because the parameter 
estimation model for a complex process will, in general, 
produce residual error signals having a non-Gaussian prob- 
ability density function. Moreover, a review of the known 
prior-art discloses that virtually all such surveillance sys- 
tems developed to date utilize or assume a Gaussian model 
of the residual error signal probability density function for 
fault detection. Hence, a significant shortcoming of the 
known prior-art is that, inter alia, parameter estimation 
based surveillance schemes will produce numerous false 
alarms due to the modeling error introduced by the assump- 
tion of a Gaussian residual error signal probability density 
function. The implication for parameter estimation based 
surveillance schemes is that the fault detection sensitivity 
must be significantly reduced to prevent false alarms thereby 
limiting the utility of the method for process surveillance. 
An alternative for statistically derived fault detection models 
is to mathematically pre-process the residual error signals to 
remove non-Gaussian elements prior to using the residual 
error signals in the fault detection model; however this 
approach requires an excess of additional processing and 
also limits the sensitivity of the surveillance method. There- 
fore, the implication of assuming a Gaussian residual error 
signal probability density function for a parameter estima- 
tion based surveillance scheme is simply that the system 
becomes less accurate thereby degrading the sensitivity and 
utility of the surveillance method. 

Many attempts to apply statistical fault detection tech- 
niques to surveillance of assets such as industrial, utility, 
business, medical, transportation, financial, and biological 
processes have met with poor results in part because the fault 
detection models used or assumed a Gaussian residual error 
signal probability density function. 

In one specific example, a multivariate state estimation 
technique based surveillance system for the Space Shuttle 
Main Engine’s telemetry data was found to produce numer- 
ous false alarms when a Gaussian residual error fault detec- 
tion model was used for surveillance. In this case, the 
surveillance system’s fault detection threshold parameters 
were desensitized to reduce the false alarm rate; however, 
the missed alarm rate then became too high for practical use 
in the telemetry data monitoring application. 

Moreover, current fault detection techniques for surveil- 
lance of assets such as industrial, utility, business, medical, 
transportation, financial, and biological processes either fail 
to recognize the surveillance performance limitations that 
occur when a Gaussian residual error model is used or, 
recognizing such limitations, attempt to artificially conform 
the observed residual error data to fit the Gaussian model. 
This may be attributed, in part, to the relative immaturity of 
the field of artificial intelligence and computer-assisted 
surveillance with regard to real-world process control appli- 
cations. Additionally, a general failure to recognize the 
specific limitations that a Gaussian residual error model 
imposes on fault decision accuracy for computer-assisted 
surveillance is punctuated by an apparent lack of known 
prior art teachings that address potential methods to over- 
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come this limitation. In general, the known prior-art teaches 
computer-assisted surveillance solutions that either ignore 
the limitations of the Gaussian model for reasons of math- 
ematical convenience or attempt to conform the actual 
residual error data to the artificial Gaussian model, for 
example, by using frequency domain filtering and signal 
whitening techniques. 

For the foregoing reasons, there is a need for a surveil- 
lance system and method that overcomes the significant 
shortcomings of the known prior-art as delineated herein- 
above. 

BRIEF SUMMARY OF THE INVENTION 

In general, and in one aspect, an embodiment of the 
invention provides a method for performing surveillance of 
an asset which comprises the steps of: obtaining residual 
data correlative to asset operation, fitting a mathematical 
model to the acquired residual data, storing the mathemati- 
cal model in a memory means, obtaining a current set of 
observed signal data from the asset; using the previously 
stored mathematical model in a sequential hypothesis test 
for determining if the current set of observed signal data is 
indicative of an asset fault condition; determining an asset 
fault cause correlative to a determined indication of an asset 
fault condition; and providing an indication correlative to a 
determined asset fault cause for providing asset surveillance. 

Moreover, Embodiments of the invention may include 
one or more of the following steps: determining a most 
probable asset fault cause from a plurality of possible asset 
fault causes; providing a response to a provided indication 
correlative to a determined asset fault cause; providing a 
control action as the provided response to a provided indi- 
cation correlative to a determined asset fault cause; and 
providing an alarm action as the provided response to a 
provided indication correlative to a determined asset fault 
cause. Embodiments of the invention may also include the 
steps of obtaining residual data correlative to at least two 
modes of asset operation; dividing the obtained residual data 
into a plurality of residual data subsets each having at least 
one mode of asset operation associated thereto; and fitting at 
least one mathematical model to at least one of the plurality 
of residual data subsets for defining at least the one math- 
ematical model stored in memory. 

In another aspect, an embodiment of the invention pro- 
vides a method for performing surveillance of an asset is 
provided comprised of the steps including: obtaining 
residual data correlative to at least two modes of asset 
operation; dividing the obtained residual data into a plurality 
of residual data subsets each having at least one mode of 
asset operation associated thereto; fitting at least one math- 
ematical model to at least one of the plurality of residual data 
subsets for defining at least one mathematical model stored 
in a memory; obtaining a current set of observed signal data 
from the asset; determining a mode of asset operation 
correlative to the current set of observed signal data obtained 
from the asset; retrieving at least one mathematical model 
from the memory means as a function of the determined 
mode of asset operation for obtaining at least one retrieved 
mathematical model, and using at least the one retrieved 
mathematical model in a sequential hypothesis test for 
determining if the current set of observed signal data is 
indicative of an asset fault condition. 

Moreover, and in combination with the above method, 
embodiments of the invention may further include one or 
more of the following steps: determining a most probable 
asset fault cause from a plurality of possible asset fault 


4 

causes; providing an indication correlative to a determined 
most probable asset fault cause; and providing a response to 
a provided indication correlative to a determined most 
probable asset fault cause wherein the provided response is, 
5 for example, a control action and/or an alarm action. 

In another aspect, an embodiment of the invention pro- 
vides a system for performing surveillance of an asset, the 
system comprising in combination: means for obtaining a 
current set of observed signal data from the asset; means for 
10 performing a sequential hypothesis test for determining if 
the current set of observed signal data is indicative of an 
asset fault condition; means for determining an asset fault 
cause correlative to a determined indication of an asset fault 
condition; and means for providing an indication correlative 
15 to a determined asset fault cause for providing asset sur- 
veillance. 

Moreover, and in combination with the above system, 
embodiments of the invention may further include one or 
2Q more of the following features: means for obtaining residual 
data correlative to at least two modes of asset operation; 
means for dividing the acquired residual data into a plurality 
of residual data subsets each having at least one mode of 
asset operation associated thereto; means for fitting at least 
25 one mathematical model to at least one of the plurality of 
residual data subsets and storing at least the one mathemati- 
cal model in a memory means; means for determining a 
mode of asset operation correlative to the current set of 
observed signal data obtained from the asset, means for 
30 retrieving at least one mathematical model from said 
memory means as a function of the determined mode of 
asset operation for obtaining at least one retrieved math- 
ematical model; and means for using at least the one 
retrieved mathematical model in performing the sequential 
35 hypothesis test for determining if the current set of observed 
signal data is indicative of an asset fault condition. Embodi- 
ments of the invention may also include one or more of the 
following features: means for determining a most probable 
asset fault cause correlative to a determined indication of an 
40 asset fault condition from a plurality of possible asset fault 
causes; means for providing an indication correlative to a 
determined most probable asset fault cause; means for 
providing a response to an indication correlative to a deter- 
mined most probable asset fault cause wherein the means for 
45 providing a response may includes means for providing a 
control and/or an alarm action to an indication correlative to 
a determined most probable asset fault cause. 

In another aspect, an embodiment of the invention pro- 
vides a method for performing surveillance of an asset 
50 which comprises the steps of obtaining a current set of 
observed signal data from the asset; performing a sequential 
hypothesis test for determining if the current set of observed 
signal data is indicative of an asset fault condition; deter- 
mining an asset fault cause correlative to a determined 
55 indication of an asset fault condition; and providing an 
indication correlative to a determined asset fault cause for 
providing asset surveillance. 

In another aspect, an embodiment of the invention pro- 
vides a computer readable medium having a program 
60 recorded thereon in which the program causes, in use, a 
computer running the program to execute a procedure for 
performing surveillance of an asset comprised of the steps 
including: obtaining a current set of observed signal data 
from the asset; performing a sequential hypothesis test for 
65 determining if the current set of observed signal data is 
indicative of an asset fault condition; determining an asset 
fault cause correlative to a determined indication of an asset 
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fault condition; and providing an indication correlative to a 
determined asset fault cause for providing asset surveillance. 

Accordingly, it should be apparent that numerous modi- 
fications and adaptations may be resorted to without depart- 
ing from the scope and fair meaning of the claims as set forth 5 
hereinbelow. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. lisa block diagram of an asset surveillance system, to 

FIG. 2 is a schematic functional flow diagram of an 
embodiment of an asset surveillance system. 

FIG. 3 is a schematic functional flow diagram of a method 
and system for training a process model consisting of at least 
one parameter estimation model and at least one fault 15 
detection model using recorded observations of the actual 
process signals. 

FIG. 4 is a schematic functional flow diagram of a method 
and system for the fault detection model training procedure. 

FIG. 5 is a schematic functional flow diagram of a method 20 
and system for performing surveillance of an asset using at 
least one parameter estimation model and at least one fault 
detection model. 

FIG. 6 is a schematic functional flow diagram of a method 
and system for the fault detection surveillance procedure. 25 

FIG. 7 is a schematic functional flow diagram of a method 
and system for performing parameter estimation and fault 
detection using a redundant sensor. 

FIG. 8 is a schematic functional flow diagram of a method 
and system for performing parameter estimation and fault 30 
detection using a generalized parameter estimation model, 
such as a multivariate state estimation model, a neural 
network model, an analytical model, or a Kalman filter 
model . 

FIG. 9 is a flow diagram of a training and surveillance 35 
procedure. 

FIG. 10 illustrates characteristics of the null and alternate 
hypotheses for the prior-art sequential probability ratio test 
(SPRT) mean tests. 

FIG. 11 illustrates characteristics of the null and alternate 
hypotheses for the prior-art SPRT variance tests. 

FIG. 12 illustrates acquired operating data, estimated 
parameter data, and residual error data for a typical Space 
Shuttle Main Engine accelerometer. 45 

FIG. 13 illustrates a probability density function of the 
residual error data for a typical Space Shuttle Main Engine 
accelerometer with comparison to a Gaussian probability 
density function. 

FIG. 14 illustrates an un-optimized one-term expansion 50 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 15 illustrates an un-optimized two-term expansion 55 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 16 illustrates an un-optimized three-term expansion 60 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 17 lists the root mean square errors for five different 65 
un-optimized probability density function models computed 
for each of six Space Shuttle Main Engine accelerometers. 
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FIG. 18 lists the root mean square errors for five different 
optimized probability density function models computed for 
each of six Space Shuttle Main Engine accelerometers. 

FIG. 19 illustrates the optimized two-term expansion 
probability density function model of the residual error data 
for a typical Space Shuttle Main Engine accelerometer with 
comparison to the actual residual error data and a Gaussian 
probability density function. 

FIG. 20 illustrates the empirical false alarm rates with 
comparison to the prior art SPRT method. 

DETAILED DESCRIPTION OF THE 
INVENTION 

Considering the drawings, wherein like reference numer- 
als denote like parts throughout the various drawing figures, 
reference numeral 10 is directed to an asset surveillance 
system: apparatus and method. 

Referring to the drawings, an embodiment of the inven- 
tion provides an asset surveillance system 10 for performing 
high sensitivity surveillance of a wide variety of assets 
which may include one or more industrial, utility, business, 
medical, transportation, financial, and biological processes 
and apparatuses wherein such system, process, apparatus 
and/or data signal source asset preferably has at least one 
distinct measured or observed signal or sequence comprised 
of characteristic data values which are processed by the 
system 10 for providing ultra- sensitive detection of the onset 
of, for example, system, process, apparatus and/or data 
signal source operating anomalies which may include sensor 
degradation, data signal degradation, and/or component per- 
formance degradation. The system 10 includes a training 
procedure 30 carried out on a computer 22 such that a 
process model 50 of an asset 12 (e.g., system, process, 
apparatus and/or data signal source) is stored in an associ- 
ated memory 24 after being learned from historical operat- 
ing data as delineated hereinbelow. The historical operating 
data includes a set or range of observations from expected or 
typical operation of the asset 12 that are acquired and 
digitized by a data acquisition means 20 and stored in a 
memory means 24 as training data 34 by using any combi- 
nation of electronic data acquisition hardware and signal 
processing software 20 known to those having ordinary skill 
in the art, and informed by the present disclosure. Addition- 
ally, historical operating data may include a set or range of 
observations from atypical operations of the asset 12 that are 
acquired and digitized by the data acquisition means 20 and 
stored in a memory 24 and included in training data 34 by 
using any combination of electronic data acquisition hard- 
ware and signal processing software 20 known to those 
having ordinary skill in the art, and informed by the present 
disclosure. 

In one embodiment, process model 50 is comprised of a 
parameter estimation model 52 or a collection of parameter 
estimation models or submodels 52 as necessary to provide 
an estimated value for each monitored item such as each 
sensor or data signal 14 of asset 12 to be monitored for the 
presence of fault conditions during a surveillance procedure 
60. Additionally, and in one embodiment, the process model 
50 is comprised of a fault detection model 54 or a collection 
of fault detection models or submodels 54 such that, for 
example, at least one fault detection model 54 is provided 
for each monitored item such as each sensor or data signal 
14 of asset 12 to be monitored for the presence of fault 
conditions during the surveillance procedure 60. Further- 
more, and in one embodiment, the process model 50 is 
comprised of a fault cause or classification model 55 which 
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may be, in general, created by using any method suitable for 
defining a fault cause or classification model, models, or 
submodels useful for determining the presence, source, or 
cause of an unacceptable asset status or condition on the 
basis of one or more fault indications. Methods suitable for 
the fault cause or classification model 55 may include, but 
are not limited to, a plurality of Bayesian belief network 
techniques, a plurality of neural network techniques, a 
plurality of decision tree techniques, a plurality of expert 
system techniques, a plurality of rule-based techniques, a 
plurality of determined similarity techniques, a plurality of 
hypothesis test techniques, and a plurality of procedural 
logic techniques. Each fault cause or classification model or 
submodel contained in the process model 50 may be created 
to implement any of a plurality of fault cause or classifica- 
tion techniques. Furthermore, the fault cause or classifica- 
tion technique implemented for an individual model or 
submodel is not constrained to be the same as the fault cause 
or classification technique implemented for any other model 
or submodel contained in the process model 50. 

The surveillance procedure 60, in one embodiment, uti- 
lizes the stored process model 50 in a fault detection 
procedure 66 for determining if a current set of observed 
signal data is indicative of an asset fault condition. The 
surveillance procedure 60 further utilizes the stored process 
model 50 for determining an asset fault cause correlative to 
a determined indication of an asset fault condition. The 
surveillance procedure 60 also provides an indication cor- 
relative to a determined asset fault cause for providing asset 
surveillance. Moreover, an embodiment of the invention 
may include an operating mode determinator procedure 31 
which provides operating modes for partitioning of the 
process model 50 which enables different parameter estima- 
tion models or methods, fault detection models or methods, 
and fault cause or classification models or methods to be 
used for surveillance within each individual operating mode 
of an asset. This ability enables lower false alarm rates and 
missed alarm rates. 

In practice, and in one embodiment, the training proce- 
dure 20 may employ the operating mode determinator 
procedure 31 to partition the historical operating data into 
one or more training data subsets that together comprise a 
training data set 34 wherein each training data subset may be 
representative of a single operating mode i (M z ) wherein M z - 
is any mode between Mode 1 (M : ) to Mode N (M^) where 
N is a positive integer or each training data subset 28 is 
representative of a subset of operating modes of the asset 12. 
In practice, a designer may first select operating modes that 
will be included in the decision model 50 by means of 
operating mode determinator procedure 31. The method 
thereafter may be comprised of a training loop wherein each 
possible operating mode of the asset 12 is assessed for 
inclusion in the decision model 50. Thus, an embodiment of 
the invention creates and uses, for the purpose of asset 
surveillance, a coordinated collection of decision models or 
submodels wherein each decision model or submodel in the 
coordinated collection is optimized for a single operating 
mode or subset of operating modes of an asset. 

For example, a rocket engine typically has different 
modes of operation typically comprised of different distri- 
bution of residuals expected from the rocket engine during, 
for example, startup, a percentage of steady power such as 
50% steady power, 100% steady power, and shutdown 
operations. 

Still referring to FIG. 2, the operating mode determinator 
procedure 31 used to classify each observation included in 
the training data set 34 may be, in general, performed using 
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any method suitable for determining the operating mode of 
the asset 12 given an observation or series of observations 
therefrom. Methods suitable for operating mode determina- 
tor procedure 31 include, but are not limited to, a plurality 
5 of mathematical or logic sequence techniques, a plurality of 
expert system techniques, a plurality of fuzzy logic tech- 
niques, a plurality of determined similarity techniques, a 
plurality of clustering techniques, and a plurality of neural 
network techniques. 

to Hence, and in contrast to the known prior art, and in one 
embodiment of the invention, parameter estimation meth- 
ods, fault detection methods, and fault cause or classification 
methods may be individually tailored for each operating 
mode of the asset thereby providing additional capability to 
15 reduce decision error rates for the surveillance system. 

Accordingly, and in one aspect, an embodiment of the 
invention provides a method for performing surveillance of 
an asset which comprises the steps of obtaining a current set 
of observed signal data from the asset; performing a sequen- 
20 tial hypothesis test for determining if the current set of 
observed signal data is indicative of an asset fault condition; 
determining an asset fault cause correlative to a determined 
indication of an asset fault condition; and providing an 
indication correlative to a determined asset fault cause for 
25 providing asset surveillance. 

In another aspect, an embodiment of the invention pro- 
vides a method for performing surveillance of an asset 
which comprises the steps of: 1) obtaining residual data 
correlative to asset operation; 2) fitting a mathematical 
30 model to the acquired residual data; 3) storing the math- 
ematical model in a memory means; 3) obtaining a current 
set of observed signal data from the asset; 4) using the 
previously stored mathematical model in a sequential 
hypothesis test for determining if the current set of observed 
35 signal data is indicative of an asset fault condition; 5) 
determining an asset fault cause correlative to a determined 
indication of an asset fault condition; and 6) providing an 
indication correlative to a determined asset fault cause for 
providing asset surveillance. 

40 Additionally, embodiments of the invention may include 
one or more of the following steps in combination with the 
above six steps in the above method: determining a most 
probable asset fault cause from a plurality of possible asset 
fault causes; providing a response to a provided indication 
45 correlative to a determined asset fault cause; providing a 
control action as the provided response to a provided indi- 
cation correlative to a determined asset fault cause; and 
providing an alarm action as the provided response to a 
provided indication correlative to a determined asset fault 
50 cause. 

Furthermore, embodiments of the invention may include 
one or more of the following steps in combination with the 
above six steps in the above method: obtaining residual data 
correlative to at least two modes of asset operation; dividing 
55 the obtained residual data into a plurality of residual data 
subsets each having at least one mode of asset operation 
associated thereto; and fitting at least one mathematical 
model to at least one of the plurality of residual data subsets 
for defining at least the one mathematical model stored in the 
60 memory. 

In another aspect, an embodiment of the invention pro- 
vides a method for performing surveillance of an asset 
comprised of the steps including: 1) obtaining residual data 
correlative to at least two modes of asset operation; 2) 
65 dividing the obtained residual data into a plurality of 
residual data subsets each having at least one mode of asset 
operation associated thereto; 3) fitting at least one math- 
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ematical model to at least one of the plurality of residual data 
subsets for defining at least one mathematical model stored 
in a memory; 4) obtaining a current set of observed signal 
data from the asset; 5) determining a mode of asset operation 
correlative to the current set of observed signal data obtained 
from the asset with mode determinator 31; 6) retrieving at 
least one mathematical model from the memory means as a 
function of the determined mode of asset operation for 
obtaining at least one retrieved mathematical model, and 7) 
using at least the one retrieved mathematical model in a 
sequential hypothesis test for determining if the current set 
of observed signal data is indicative of an asset fault 
condition. 

Additionally, embodiments of the invention may include 
one or more of the following steps in combination with the 
above seven steps in the above method: determining a most 
probable asset fault cause from a plurality of possible asset 
fault causes; providing an indication correlative to a deter- 
mined most probable asset fault cause; and providing a 
response to a provided indication correlative to a determined 
most probable asset fault cause wherein the provided 
response is, for example, a control action and/or an alarm 
action. 

In another aspect, an embodiment of the invention pro- 
vides a computer readable medium having a program 
recorded thereon in which the program causes, in use, a 
computer running the program to execute a procedure for 
performing surveillance of an asset comprised of the steps 
including: obtaining a current set of observed signal data 
from the asset; performing a sequential hypothesis test for 
determining if the current set of observed signal data is 
indicative of an asset fault condition; determining an asset 
fault cause correlative to a determined indication of an asset 
fault condition; and providing an indication correlative to a 
determined asset fault cause for providing asset surveillance. 
Any type of computer readable medium may be employed 
and examples include floppy disks, conventional hard disks, 
CD-ROMS, Flash ROMS, nonvolatile ROM, and RAM. 

In another aspect, an embodiment of the invention pro- 
vides a system for performing surveillance of an asset, the 
system comprising in combination: means for obtaining 
residual data correlative to at least two modes of asset 
operation; means for dividing the acquired residual data into 
a plurality of residual data subsets each having at least one 
mode of asset operation associated thereto; means for fitting 
at least one mathematical model to at least one of the 
plurality of residual data subsets and storing at least the one 
mathematical model in memory 24; means for determining 
a mode of asset operation correlative to the current set of 
observed signal data obtained from the asset, means for 
retrieving at least one mathematical model from said 
memory means as a function of the determined mode of 
asset operation for obtaining at least one retrieved math- 
ematical model; and means for using at least the one 
retrieved mathematical model in performing a sequential 
hypothesis test for determining if the current set of observed 
signal data is indicative of an asset fault condition; means for 
determining a most probable asset fault cause correlative to 
a determined indication of an asset fault condition from a 
plurality of possible asset fault causes; means for providing 
an indication correlative to a determined most probable asset 
fault cause; means for providing a response to an indication 
correlative to a determined most probable asset fault cause 
wherein the means for providing a response may includes 
means for providing a control and/or an alarm action to an 
indication correlative to a determined most probable asset 
fault cause. 
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In particular, and referring to FIGS. 1 through 3, an 
embodiment the invention provides a surveillance system 10 
for providing surveillance of an asset such as a system, a 
process, an apparatus and/or a data signal source 12 by 
5 providing training procedure 30 and surveillance procedure 
60. 

Referring to FIGS. 3 and 4, one embodiment of the 
invention provides the training procedure 30 comprised of a 
calibrate parameter estimator procedure 36, a calibrate fault 
to detector procedure 38, and a calibrate fault cause or classi- 
fication procedure 39. 

The calibrate parameter estimator procedure 36, in one 
embodiment, creates one or more parameter estimation 
models 52, which may include submodels, and trains each 
15 parameter estimation model/submodel by utilizing training 
data 34 correlative to expected and/or atypical asset opera- 
tion and, for example, utilizing a multivariate state estima- 
tion technique (MSET) procedure. The calibrate parameter 
estimator procedure 36 further stores the one or more 
20 parameter estimation models 52 as a new parameter estima- 
tion model in a process model 50. As noted, the process 
model 50 may contain one or more parameter estimation 
models/submodels depending upon design requirements. 
Additionally, each parameter estimation model contained in 
25 the process model may be created to implement any one of 
a plurality of parameter estimation techniques. 

The calibrate fault detector procedure 38, in one embodi- 
ment, makes use of the parameter estimation models 52 to 
provide estimated values for at least one signal parameter 
30 contained in the training data 34. Generally, the calibrate 
fault detector procedure will create a separate and distinct 
fault detection model for each sensor or data signal associ- 
ated with the asset being monitored for the presence of fault 
conditions during the surveillance procedure. 

35 In one embodiment, the calibrate fault detector procedure 
38 includes a method of fitting a mathematical model in the 
form of, for example, a standard Gaussian probability den- 
sity function (PDF) to a training data residual distribution 
(computed as a function of estimated process parameters and 
40 the training data) and then adding successive higher order 
terms of a remainder function to the standard Gaussian PDF 
for the purpose of defining a general PDF that better fits the 
computed training data residual distribution. Other tech- 
niques for fitting a general PDF to the training data are 
45 similarly feasible and useful and include, for example, a 
technique for fitting a polynomial function to the training 
data. 

Moreover, other techniques for fitting a general PDF to 
the training data are similarly feasible and useful and 
50 include, for example, a technique for fitting at least one 
mathematical function correlative to a frequency distribu- 
tion to the residual data or residual data subsets. The step or 
procedure of fitting at least one mathematical function 
correlative to a frequency distribution to the residual data or 
55 residual data subsets may include a step of fitting at least one 
function defining a frequency curve, a frequency histogram, 
and/or a frequency polygon to the residual data or residual 
data subsets. 

Hence, in one embodiment, the asset surveillance system 
60 10 includes a fault detection model 54 of unconstrained 
probability density function form and having a procedure 
suitable for overcoming a performance limiting trade-off 
between probability density function modeling complexity 
and decision accuracy that has been unrecognized by the 
65 known prior-art. Specifically, an embodiment of the inven- 
tion can employ any one of a plurality of residual error 
probability density function model forms, including but not 
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limited to a Gaussian form, thereby allowing a surveillance 
system to utilize the model form best suited for optimizing 
surveillance system performance. 

Moreover, an embodiment of the invention provides a 
surveillance system 10 that uses a computer-assisted learn- 
ing procedure to automatically derive the most suitable form 
of the residual error probability density function model by 
observation and analysis of a time sequence of process 
signal data and by a combination of a plurality of techniques. 
This ability enables surveillance to be performed by an 
embodiment of the invention with lower false alarm rates 
and lower missed alarm rates than can be achieved by the 
known prior-art systems and methods. 

The calibrate fault cause or classification procedure 39, in 
one embodiment, may be performed using any method 
suitable for defining a fault cause model useful for deter- 
mining the presence, source or cause of an unacceptable 
asset status or condition on the basis of one or more fault 
indications. Methods suitable for the fault cause model(s)/ 
submodel (s) creation procedure may include, but are not 
limited to, Bayesian belief network techniques, neural net- 
work techniques, decision tree techniques, expert system 
techniques, rule-based techniques, determined similarity 
techniques, hypothesis test techniques, and procedural logic 
techniques. Each fault classification model or submodel 
contained in the decision model may be created to imple- 
ment any of a plurality of fault cause or classification 
techniques. Further, the fault cause technique implemented 
for an individual model or submodel is not constrained to be 
the same as the fault model technique implemented for any 
other model or submodel contained in the decision model. 

In one embodiment, the training procedure 30 is com- 
pleted when all desired training data has been used to 
calibrate the process model 50 or when all desired expected 
operating modes of the asset 12 have been assessed. At this 
point, the process model 50 preferably includes one or more 
parameter estimation models and/or submodels 52, fault 
detection models and/or submodels 54, and fault cause or 
classification models and/or submodels 55 for each source 
such as each sensor and/or data signal associated with the 
asset 12 being monitored for the presence of fault conditions 
during the surveillance procedure 60. The process model 50 
is thereafter used for performing surveillance of an asset. 

Referring to FIGS. 1 through 6, and in one embodiment 
of the invention, the surveillance procedure 60 is performed 
with a fault detection procedure 66 using an adaptive 
sequential probability (ASP) fault detection test 78 com- 
prised of the general probability density function model 
empirically derived from the numerical analysis of the asset 
training data. 

In one embodiment, the surveillance procedure 60 
acquires and digitizes current asset data with procedure 62 
and then estimates process parameters as a function of the 
acquired digitized current asset data and the parameter 
estimation model(s) obtained from the calibrate parameter 
estimator procedure. Then, fault detection is determined by 
first computing data residuals, process 76, as a function of 
the estimated process parameters and the acquired digitized 
current asset data and then performing the ASP test(s) 78 as 
a function of the fault detection models and thus, as a 
function of the fitted general PDF obtained in the calibrate 
fault detector procedure. Each ASP test returns one of three 
possible states: a not null state which rejects the probability 
that a null hypothesis is true and excepts an alternative 
hypothesis correlative to unexpected operation of the asset; 
a null state which accepts the probability that a null hypoth- 
esis is true and excepts the null hypothesis correlative to 
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expected operation of the asset; and an in-between state 
which excepts neither the null hypothesis nor the alternative 
hypothesis as being true and requires more data to reach a 
conclusion. 

5 In one embodiment, the results of the fault detection 
procedure are then analyzed using an asset fault cause 
procedure 84 for determining an asset fault cause to a 
determined indication of an asset fault condition obtained 
when the analysis determines that the results indicate unex- 
10 pected operation of the asset. Next, and in one embodiment, 
an indication procedure 86 provides an indication correlative 
to a determined asset fault cause and a response procedure 
88 provides an action such as an alarm, control and/or a 
communication action for providing asset surveillance. 

15 Process Model Training Procedure: 

More specifically, and referring to FIGS. 1 and 3, the 
training procedure 30 of the system 10 includes a method 
and apparatus for training or preparing the process model 50 
using historical operating data from the asset 12 that has 
20 been acquired by the data acquisition means 20 using any 
combination of conventional electronic data acquisition 
hardware and signal processing software as is well known in 
the art. The historical operating data is acquired in digital 
format and stored in memory means 24 using a data storage 
25 procedure 32 to create a training data set 34. 

The training data set 34 includes at least N discrete 
observations of the asset 12 wherein each single observa- 
tion, herein denoted Xobs, is comprised of a vector of data 
values for each signal parameter to be included in the 
30 process model 50 . For the purposes of the training procedure 
30, the number of observations, N, acquired is at least great 
enough to adequately bound the operating state space of the 
asset 12. Thus, the training data set 34 provides a represen- 
tative sample of the signals produced by the asset 12 during 
35 at least one normal modes of operation. 

Upon acquiring the training data set 34, the unique 
training procedure 30 can be implemented. 

The unique training procedure 30 is comprised of a 
calibrate parameter estimator procedure 36 and a calibrate 
40 fault detector procedure 38. The calibrate parameter estima- 
tor procedure 36 creates the parameter estimation model 52 
and trains the parameter estimation model 52 using the 
training data 34. The calibrate parameter estimator proce- 
dure 36 further stores this model as a new parameter 
45 estimation model 52 in the process model 50. 

The process model 50 may contain one or more parameter 
estimation models 52 depending upon the requirements of 
the approach taken by a designer. Continuing to refer to FIG. 
3, the training procedure 30 may be, in general, performed 
50 using any parameter estimation method suitable for defining 
a parameter estimation model 52 useful for estimating the 
values of one or more process signals. Methods suitable for 
the calibrate parameter estimator procedure 36 include, but 
are not limited to, a plurality of redundant sensor techniques, 
55 a plurality of multivariate state estimation techniques, a 
plurality of kernel regression techniques, a plurality of 
neural network techniques, a plurality of mathematical 
model techniques, a plurality of autoregressive moving 
average techniques, and a plurality of Kalman filter tech- 
60 niques. Each parameter estimation model 52 contained in 
the process model 50 may be created to implement any one 
of a plurality of parameter estimation techniques. Further, 
the parameter estimation technique implemented for an 
individual parameter estimation model 52 is not constrained 
65 to be the same as the parameter estimation technique imple- 
mented for any other parameter estimation model 52 con- 
tained in the process model 50. 
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One example of the calibrate parameter estimator proce- 
dure 36 would be the computation of a bias term between 
two redundant sensors wherein the parameter estimation 
model 52 used for estimating the value of one sensor during 
the surveillance procedure 60 consisted of summing the 
observed value of a second redundant sensor with a bias 
term computed during the training procedure 30 as the mean 
difference between the two sensor values over the training 
data set 34. More sophisticated examples of the training 
procedure 30 using multivariate state estimation techniques 
will be described herein below. 

Still referring to FIG. 3, the calibrate fault detector 
procedure 38 makes use of the parameter estimation models 
52 to provide estimated values for at least one signal 
parameter contained in the training data. Generally, the 
calibrate fault detector procedure 38 will create a separate 
and distinct fault detection model 54 for each sensor or data 
signal of asset 12 to be monitored for the presence of fault 
conditions during the surveillance procedure 60. As delin- 
eated hereinabove, one embodiment of the invention pro- 
vides the fault detection model 54 as an element of the 
process model 50 for the asset 12 that is derived during the 
training procedure 30. In particular, one embodiment of the 
invention encompasses a statistical hypothesis test type of 
fault detection model 54 having novel and unique charac- 
teristics and calibration procedures described herein includ- 
ing but not limited to having a probability density function 
model empirically derived from a numerical analysis of 
asset operating data. 

Continuing to refer to FIG. 3, the calibrate fault cause or 
classification procedure 39, in one embodiment, may be 
performed using any method suitable for defining a fault 
cause model useful for determining the presence, source or 
cause of an unacceptable asset status or condition on the 
basis of one or more fault indications. Methods suitable for 
the fault cause model(s)/submodel(s) creation procedure 
may include, but are not limited to, Bayesian belief network 
techniques, neural network techniques, decision tree tech- 
niques, expert system techniques, rule-based techniques, 
determined similarity techniques, hypothesis test tech- 
niques, and procedural logic techniques. Each fault classi- 
fication model or submodel contained in the decision model 
may be created to implement any of a plurality of fault cause 
or classification techniques. Further, the fault cause tech- 
nique implemented for an individual model or submodel is 
not constrained to be the same as the fault model technique 
implemented for any other model or submodel contained in 
the decision model. 

Continuing to refer to FIG. 3, and in one embodiment, the 
training procedure 30 is completed when all desired training 
data has been used to calibrate the process model 50 or when 
all desired expected operating modes of the asset 12 have 
been assessed. At this point, the process model 50 preferably 
includes one or more parameter estimation models and/or 
submodels 52, fault detection models and/or submodels 54, 
and fault cause or classification models and/or submodels 55 
for each source such as each sensor and/or data signal 
associated with the asset 12 being monitored for the pres- 
ence of fault conditions during the surveillance procedure 
60. The process model 50 is thereafter used for performing 
surveillance of an asset. 

Referring to FIG. 4, the training procedure 30 is illus- 
trated in additional detail. A designer 16 initializes the 
calibrate parameter estimators procedure 36 by specifying a 
set of parameter estimator methods and settings 40. The 
parameter estimator methods and settings 40 are then used 
to operate on the training data 34 via a nominal signal 
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behavior modeling procedure 42, for example using an 
MSET training procedure as described herein in detail, in 
order to create the parameter estimation models 52, which 
are stored in the process model 50. 

5 Still referring to FIG. 4, the training procedure 30 next 
proceeds to the calibrate fault detectors procedure 38 
wherein the parameter estimation models 52 are an input to 
the procedure. The designer 16 initializes the calibrate fault 
detectors procedure 38 by specifying a set of fault detector 
to methods and settings 46. Next, the estimate process param- 
eters procedure 64 operates the parameter estimation models 
52 over the training data 34 to generate an estimated value 
for each monitored signal value contained in the training 
data 34. It is important that the estimate process parameters 
15 procedure 64 used in the calibrate fault detectors procedure 
38 be the same estimate process parameters procedure 64 
that will later be used in the surveillance procedure 60 
(reference FIGS. 1 and 2 for surveillance procedure 60). 
Next, a compute training data residuals procedure 44 cal- 
20 culates the training data residuals for each monitored signal, 
which are calculated as the difference between the training 
data value and the corresponding estimated data value for 
each monitored signal. The training data residuals are next 
used by a compute nominal residual probability density 
25 function (PDF) procedure 48 to create a fault detection 
model 54 for each monitored signal. In one embodiment of 
the invention, the fault detection models 54 is typically 
comprised of mathematical descriptions of the probability 
density function that best characterizes or best fits the 
30 training data residual for the monitored signal. The training 
data is presumed to accurately characterize the expected 
normal operating states of the asset 12. Therefore, the 
training data residuals are characteristic of the expected 
normal deviations between the observed signal values and 
35 the values estimated using the parameter estimation models 
52. The fault detection models 54 are stored in the process 
model 50 thereby completing the training procedure. 

In one embodiment, the system 10 computes the fault 
detection models 54 by the means of the compute nominal 
40 residual probability density function (PDF) procedure 48 . As 
will be described mathematically herein below, the compute 
nominal residual probability density function (PDF) proce- 
dure 48 fits a general open-ended probability function to the 
training data residuals and employs this fitted function when 
45 implementing a herein named Adaptive Sequential Probabil- 
ity (ASP) method and system for computing the fault 
detection model 54 and thereafter employing said fault 
detection model 54 for the purpose of performing a fault 
detection procedure 66 of the surveillance procedure 60. 

50 Surveillance Procedure: 

Referring to FIG. 5, the surveillance procedure 60 is 
comprised of acquiring successive vectors of current oper- 
ating data and determining for each such observation vector 
whether the current operating data is indicative of a fault or 
55 failure of the asset 12. The surveillance procedure 60 further 
includes implementing an alarm or control action 70 for the 
purpose of notifying an operator and/or taking a corrective 
action in response to a detected fault or failure of the asset 
12. The surveillance procedure 60 is in general an open- 
60 ended data acquisition and analysis loop that continues until 
such time as the operator chooses to terminate the surveil- 
lance 74. 

More specifically, and referring to FIG. 5, the surveillance 
procedure 60 begins 58 with an acquire current operating 
65 data procedure 62 that employs the data acquisition and 
digitization means 20 (FIG. 1) to acquire a current set of 
signal data from the monitored asset 12. The current set of 
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signal data is provided to the estimate process parameters 
procedure 64 that uses the parameter estimation models 52 
to estimate values for one or more of the current signal data 
values. 

The observed and estimated data are next provided to a 
perform fault detection procedure 66 that uses one or more 
of the fault detection models and/or submodels 54 to per- 
form a fault found procedure for determining whether a fault 
is found in the current operating data. If a fault is found is 
true, the fault cause procedure 84 determines a cause to the 
determined asset fault condition obtained when the analysis 
determines that the results indicate unexpected operation of 
the asset. Next, and in one embodiment, an indication 
procedure 86 provides an indication correlative to a deter- 
mined asset fault cause and a response procedure 88 pro- 
vides an action such as an alarm, control, and/or a commu- 
nication action for providing asset surveillance. For 
example, if a fault is found is true the cause can be 
determined and indicated for providing an alarm and/or 
control action is taken by for example, an operator console 
80 and/or an automated process control system or means 82. 

Upon completing the fault found procedure 68, the sur- 
veillance procedure 60 then repeats for the next available set 
of signal data for as long as a surveillance complete decision 
procedure 72 determines that additional surveillance data are 
available or terminates at surveillance complete step 74 
when no more surveillance data are available or when 
terminated by an operator or other means. 

Referring now to FIG. 6, the perform fault detection 
procedure 66 of surveillance procedure 60 is illustrated in 
additional detail. For each current set of signal data values 
acquired the estimate process parameters procedure 64 uses 
the parameter estimation models 52 to estimate values for 
one or more of the current signal data values. The compute 
data residuals procedure 76 performs a mathematical trans- 
formation on the acquired and estimated values to produce 
a current set of residual data values. Said mathematical 
transformation is most typically a simple mathematical 
difference, however, any appropriate transformation may be 
used including transformations that smooth and/or remove 
correlated and uncorrelated noise from the residual data 
values. The residuals produced and transformed in the 
compute data residuals procedure 76 are next processed by 
a perform ASP tests procedure 78 that uses the fault detec- 
tion models 54 to produce a ASP fault indication. Next, the 
fault found decision procedure 68 is performed on the basis 
of the ASP fault indication results produced by the perform 
ASP tests procedure 78. The fault found decision procedure 
68 may have any one of a plurality of structures and 
procedures, including but not limited to methods and sys- 
tems to perform false alarm filtering by means of observing 
a time series of ASP fault indication results for the purposes 
of determining the actual presence of a fault. In one embodi- 
ment of the invention, a conditional probability fault found 
decision procedure 68 is used to perform said false alarm 
filtering. 

Continuing to refer to FIG. 6, the estimate process param- 
eters procedure 64 uses the parameter estimation models 52 
to estimate values for one or more of the current signal data 
values wherein the parameter estimation method used may 
have any one of a plurality of structures and procedures, 
including but not limited to, a plurality of redundant sensor 
techniques, a plurality of multivariate state estimation tech- 
niques, a plurality of kernel regression techniques, a plural- 
ity of neural network techniques, a plurality of mathematical 
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model techniques, a plurality of autoregressive moving 
average techniques, and a plurality of Kalman filter tech- 
niques. 

Referring to FIG. 7, one possible redundant sensor tech- 
5 nique for the estimate process parameters procedure 64 is 
illustrated. The acquire current operating data procedure 62 
is used to acquire current signal data values from signals 14 
monitored from asset 12 via sensors 18. The estimated value 
for a first redundant sensor signal is computed using a 
to mathematical transformation on the acquired value of a 
second redundant sensor signal. Said mathematical transfor- 
mation is the estimate process parameters procedure 64 that 
in this case may be a simple equivalence or may include 
biasing, de-noising or other signal processing. The compute 
15 data residuals procedure 76 is then performed followed by 
the perform ASP tests procedure 78 as described herein- 
above and further delineated hereinbelow. 

Referring to FIG. 8, one possible multivariable parameter 
estimation technique for the estimate process parameters 
20 procedure 64 is illustrated. The acquire current operating 
data procedure 62 is used to acquire current signal data 
values from signals 14 monitored from asset 12 via sensors 
18. The estimated value for one or more sensor signals is 
computed using a mathematical transformation on the 
25 acquired values of one or more sensor signals. Said math- 
ematical transformation is the estimate process parameters 
procedure 64 that in this case may implement any feasible 
parameter estimation technique or procedure, including but 
not limited to a plurality of multivariate state estimation 
30 techniques, a plurality of kernel regression techniques, a 
plurality of neural network techniques, a plurality of math- 
ematical model techniques, and a plurality of Kalman filter 
techniques. The compute data residuals procedure 76 is then 
performed followed by the perform ASP tests procedure 78 
35 as described hereinabove and further delineated hereinbe- 
low. 

Referring again to FIG. 6, one usefulness of an embodi- 
ment of the invention is, inter alia, the improvement 
achieved in the accuracy of the fault decision procedure 68 
40 that results from the improvement achieved in the accuracy 
of perform fault detection procedure 66 made possible by 
the novel perform ASP tests procedure 78. Improving the 
accuracy of the fault decision procedure 68 accomplishes a 
reduction in the number of false alarms sent to a process 
45 operator or control system that can in turn result in an 
erroneous alarm or control action by the alarm or control 
action procedure 70. Further, improving the accuracy of the 
fault decision procedure 68 accomplishes a reduction in the 
number of missed alarms thereby accomplishing more 
50 timely alarm or control action by the alarm or control action 
procedure 70. One embodiment of the invention thereby 
enables improved operating safety, improved efficiency and 
performance, and reduced maintenance costs for a wide 
variety of industrial, utility, business, medical, transporta- 
55 tion, financial, and biological processes and apparatuses 
wherein such process and/or apparatus asset 12 has at least 
one characteristic data signal suitable for surveillance. 

In use and operation, FIGS. 1 through 9 show an embodi- 
ment of the system 10 employing the fault detection model 
60 54 contained in the process model 50, the accompanying 
fault detection procedure 66 having the perform ASP tests 
procedure 78, and the fault cause or classification model 55. 
In a typical surveillance procedure, the asset 12 is the source 
of at least one signal 14 that is acquired and digitized using 
65 conventional data acquisition means 20 for providing the 
data acquisition procedure for the purpose of computer- 
assisted surveillance. The digitized signal data is generally 
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evaluated using computer 22 having computer software 
modules implementing the estimate process parameters pro- 
cedure 64, the perform fault detection procedure 66, and the 
fault cause or classification procedure 84. The estimate 
process parameters procedure 64 is used to produce an 5 
estimated signal value for at least one process signal 14 
emanating from the asset 12. The estimate process param- 
eters procedure 64 in general makes use of the process 
model 50 stored in a memory means 24 associated with the 
computer 22 to produce the estimated signal values. The to 
estimated signal values are then generally evaluated using 
the perform fault detection procedure 66 to identify faults or 
operating anomalies associated with the asset 12. When a 
fault is found, the results are thereafter communicated to for 
evaluation to the asset fault cause or classification procedure 15 
84 for determining an asset fault cause or classification to a 
determined indication of an asset fault condition obtained 
when the analysis determines that the results indicate unex- 
pected operation of the asset. Next, and in one embodiment, 
an indication procedure 86 provides an indication correlative 20 
to a determined asset fault cause and a response procedure 
88 provides an action such as an alarm, control and/or a 
communication action for providing asset surveillance. For 
example, when a fault is found and a cause is determined, an 
indication can be provided by, for example, a conventional 25 
communications means 26 (as is known to those having 
ordinary skill in the art, and informed by the present 
disclosure) to the operator console 80 and/or to the asset 
control means or system 82 for possible alarm and/or control 
action. 30 

The computer 22 along with the associated memory 
means 24 can also be employed to perform the training and 
surveillance procedures 30, 60 as delineated hereinabove 
and to produce and store all the desired data associated with 
these procedures, for example, the historical training data, 35 
designer defined settings, and the process model. 

Multivariate State Estimation Technique (MSET) for Esti- 
mate Process Parameters Procedure: 

In one embodiment of the invention, the estimate process 
parameters procedure 64 uses a multivariate state estimation 40 
technique (MSET) procedure having an MSET parameter 
estimation model 52 structure. The US Department of Ener- 
gy’s Argonne National Laboratory originally developed the 
implementation of MSET described herein for surveillance 
of sensors and components in nuclear power plant applica- 45 
tions. However, other implementations of a multivariate 
state estimation technique are possible and useful in con- 
junction with the system 10. MSET is in general a statisti- 
cally derived parameter estimation algorithm that uses 
advanced pattern recognition techniques to measure the 50 
similarity or overlap between signals within a defined 
domain of asset operation (set of asset operating states). 
MSET “learns” patterns among the signals by numerical 
analysis of historical asset operating data. These learned 
patterns or relationships among the signals are then used to 55 
identify the learned state that most closely corresponds with 
a new signal data observation. By quantifying the relation- 
ship between the current and learned states, MSET estimates 
the current expected response of the asset signals. MSET 
then uses threshold comparisons or a form of statistical 60 
hypothesis testing, such as a Sequential Probability Ratio 
Test (SPRT) as disclosed in U.S. Pat. No. 5 , 459,675 which 
is hereby incorporated by reference in its entirety or the 
Adaptive Sequential Probability (ASP) procedure to com- 
pare the current estimated value of a signal with its observed 65 
value. The MSET procedure provides an accurate and 
widely applicable method to estimate the operating signal 


values for an asset. However, other implementations of the 
parameter estimation procedure are possible and useful in 
conjunction with system 10. 

An MSET model is created for the asset 12 using the 
MSET training algorithms to learn the inherent data rela- 
tionships within a set of historical asset operating data. The 
trained MSET model is then used with the MSET parameter 
estimation and fault detection algorithms to perform the 
process surveillance function when presented with a new 
observation of signal data values. The following sections 
will first provide a mathematical overview of the MSET 
algorithms and procedures useful for training a parameter 
estimation model and for using this trained model for 
process surveillance. The description is followed by a 
detailed description of a one embodiment of the invention 
using a novel Adaptive Sequential Probability (ASP) pro- 
cedure for fault detection during process surveillance. 

MSET Training and Parameter Estimation Procedures: 

The MSET methods are generally described in the fol- 
lowing two US Government documents produced and main- 
tained by the US Department of Energy’s Argonne National 
Laboratory, Argonne, 111., disclosure of which is incorpo- 
rated in its entirety herein by reference. 

J. P. Herzog, S. W. Wegerich, R. M. Singer, and K. C. 
Gross, “Theoretical Basis of the Multivariate State Estima- 
tion Technique (MSET),” Argonne National Laboratory, 
ANL-NT-49, December 1997. 

J. P. Herzog, S. W. Wegerich, K. C. Gross, and R. M. 
Singer, “MSET: Code Structure and Interface Development 
Guide,” ANL-NT-48, August 1997. 

The MSET algorithm uses pattern recognition with his- 
torical operating data from an asset to generate a parameter 
estimation model. If data is collected from a process over a: 
range of operating states, this data can be arranged in matrix 
form, where each column vector (a total of m) in the matrix 
represents the measurements made at a particular state. 
Thus, this matrix will have the number of columns equal to 
the number of states at which observations were made and 
the number of rows equal to the number of measurements (a 
total of n signal data values) that were available at each 
observation. We begin by defining the set of measurements 
taken at a given time t^ as an observation vector X(t^.), 

({/)> *2 (tj), X„(tj)] T (El) 

where x z (t^) is the measurement from signal i at time t .. We 
then define the data collection matrix as the process memory 
matrix D: 


<* 1,1 

<* 1,2 ■ 

■ ■ <* 1 ,™ 

<* 2,1 

<* 2,2 ■ 

■ ■ <*2,»j 

< 4,1 

< 4,2 

■ ■ < 4 ,™ 




(E2) 


Each of the column vectors (X(t^.)) in the process memory 
matrix represents an operating state of the asset. Any number 
of observation vectors can be assigned to the process 
memory matrix. Training an MSET model includes collect- 
ing enough unique observation vectors from historical 
operation of the asset during normal conditions such that the 
process memory matrix encompasses the lull dynamic oper- 
ating range of the asset. Computation of the D matrix is the 
first of three steps in the method for training an MSET model 
based on historical operating data. 
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One of at least two algorithms is used by MSET to select 
the vectors in the D matrix. The MinMax algorithm extracts 
vectors that bound the vector space defined by the training 
data and returns the smallest process memory matrix that 
will produce an effective system model (see also U.S. Pat. 5 
No. 5,764,509 which is hereby incorporated by reference in 
its entirety). A vector ordering algorithm selects and 
includes representative vectors from the inner regions of the 
vector space producing a more accurate system model. 

Once the process memory matrix has been constructed, to 
MSET is used to model the dynamic behavior of the system. 

For each current observation of the system (Xobs), MSET 
compares the observation vector to the stored operating 
states to calculate an estimate of the process parameter 
values. The parameter estimate of the current process state 15 
(Xest) is an n-element vector that is given by the product of 
the process memory matrix and a weight vector, W: 

x„ t =mw (E3) 

The weight vector represents a measure of similarity 20 

between the estimate of the current state and the process 
memory matrix. To obtain the weight vector, we minimize 
the error vector, R, where: 

R=X ob -X est (E4) 25 

The error is minimized for a given state when: 

W=(D r ®D) -1 •(D r ®X 0 £ a ) (E5) 

This equation represents a “least squares” minimization 30 

when the pattern recognition operator ®is the matrix dot 
product. Several advanced pattern recognition operators 
have been defined that provide excellent parameter estima- 
tion performance. Pattern recognition operators used by 
MSET include, but are not limited to, the System State 35 
Analyzer (SSA) method (see also U.S. Pat. No. 4,937,763 
which is hereby incorporated by reference in its entirety), the 
Bounded Angle Ratio Test (BART) method (see also U.S. 
Pat. No. 5,987,399 which is hereby incorporated by refer- 
ence in its entirety), the Vector Pattern Recognizer (VPR) 40 
method, the Vector Similarity Evaluation Technique (VSET) 
method, and the Probabilistic State Estimation Method 
(PSEM). 

Once the weight vector is found, the resulting current state 
estimate of the system (i.e., the parameter estimate vector) 45 
is given by: 

x esr *(b T ®by l *(b T ®x ob j (E6) 

The first application of the pattern recognition operator in 
equation E6 (D r <g)D) involves a comparison between the 50 
row vectors in the D r matrix and each of the column vectors 
in the D matrix. If we define G=D r <g>D, then G, the similarity 
matrix, is an m by m matrix. The element in the i-th row and 
j-th column of the matrix (g zV ) represents a measure of the 55 
similarity between the i-th and j-th column vectors (i.e., 
memorized states) in the process memory matrix. The sec- 
ond application of the pattern recognition operator in equa- 
tion E6 (D r <g>X 0&s ) involves a comparison between the row 
vectors in the D T matrix and each of the elements in the 60 
observation vector Xobs. If we define A=D T ®X obs , then A, 
the similarity vector, is an m by 1 vector. Each element in the 
similarity vector is a measure of the similarity between the 
observation vector and the i-th column vector (i.e., memo- 
rized state) in the process memory matrix. 65 

Note that the similarity matrix is a function of the process 
memory matrix only. Thus, the similarity matrix and its 
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inverse Ginv=(D r ®D) -1 can be calculated as soon as the 
process memory matrix has been derived thereby making the 
application of MSET to an on-line surveillance system more 
computationally efficient. Computation of the Ginv matrix 
initializes the parameter estimation model and completes the 
second of three steps in the procedure for training an MSET 
model based on historical operating data. 

The third and final step in the MSET training procedure 
includes analyzing the historical training data using equation 
E4 to produce a residual error vector, R, for each observation 
vector in the training data. The collection of residual error 
vectors comprises the training data residuals necessary for 
training the fault detection model 54 using any one of a 
plurality of techniques, including but not limited to the 
SPRT technique, and the novel ASP technique. 

The Sequential Probability Ratio Test (SPRT) technique is 
a statistical hypothesis test fault detection algorithm histori- 
cally used for MSET process surveillance. The SPRT tech- 
nique is described in U.S. Pat. No. 5,459,675, which is 
incorporated herein by reference in its entirety. The SPRT 
analyzes a sequence of discrete residual error values from a 
signal to determine whether the sequence is consistent with 
normal signal behavior or with some other abnormal behav- 
ior. When the SPRT reaches a decision about the current 
signal behavior, e.g., that the signal is behaving normally or 
abnormally, the decision is reported and the test continues 
analyzing the signal data. For any SPRT, signal behavior is 
defined to be normal when the signal data adheres to a 
Gaussian probability density function (PDF) with mean 0 
and variance a 2 . Normal signal behavior is referred to as the 
null hypothesis, H 0 . MSET employs four specific SPRT 
hypothesis tests. Each test determines whether current signal 
behavior is consistent with the null hypothesis or one of four 
alternative hypotheses. The four tests are known as the 
positive mean test, the negative mean test, the nominal 
variance test, and the inverse variance test. For the positive 
mean test, the corresponding alternative hypothesis, H l5 is 
that the signal data adhere to a Gaussian PDF with mean +M 
and variance cr 2 . For the negative mean test, the correspond- 
ing alternative hypothesis, H 2 , is that the signal data adheres 
to a Gaussian PDF with mean -M and variance a 2 . For the 
nominal variance test, the corresponding alternative hypoth- 
esis, H 3 , is that the signal data adheres to a Gaussian PDF 
with mean 0 and variance Vcr 2 . For the inverse variance test, 
the corresponding alternative hypothesis, H 4 , is that the 
signal data adheres to a Gaussian PDF with mean 0 and 
variance cE/V. The user-assigned constants M and V control 
the sensitivity of the tests. 

Limitations of the SPRT Fault Detector Training and 
Surveillance Method and System: 

One significant shortcoming of the SPRT technique is 
found in the assumptions underlying its mathematical for- 
mulation. Specifically, the SPRT technique presumes that 
the residual error signals adhere to a Gaussian probability 
density function. For residual error signals that are non- 
Gaussian, the fault detector false alarm rates and/or missed 
alarm rates specified by the designer are not accomplished 
by the SPRT procedure thereby degrading the fault decision 
accuracy of the asset control and/or surveillance system. The 
novel ASP technique of one embodiment of the invention 
specifically removes the assumption that the residual error 
signals adhere to a Gaussian probability density function. 
The ASP technique implements any one of a plurality of 
methods to numerically fit a probability density function to 
the observed residual error signal distribution that is char- 
acteristic of normal asset operation. The derived probability 
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density function is then used to perform a dynamic statistical 
hypothesis test thereby achieving the fault detector false 
alarm and missed alarm rates specified by the designer and 
improving the fault decision accuracy of the asset control 
and/or surveillance system. 5 

Fault Detection Using Statistical Hypothesis Test Proce- 
dures: 

The general theory underlying the statistical hypothesis 
test will now be delineated below. Next, the SPRT imple- 
mentation of a dynamic statistical hypothesis test will be to 
described. Finally, the novel ASP implementation of a 
dynamic statistical hypothesis test for non-Gaussian residual 
error signals will be delineated in detail along with a 
delineation of its reduction to practice. 

Bayes’ Rule for a Single Observation: 15 

Statistical decision problems in which there are just two 
possible outcomes constitute an important class called 
binary hypothesis testing problems. The possible states of a 
system are called hypotheses and each individual state of the 
system is termed a simple hypothesis. A simple hypothesis 20 
is a complete specification of a probability distribution for 
the system (i.e., the distribution of possible observations or 
samples from the system). The “hypothesis” being tested is 
that the particular distribution is the correct one. 

The basic operation in a binary hypothesis test is to 25 
evaluate the veracity of a hypothesis, H, given a piece of 
evidence or observation, e, from the system being studied. 
Because of the unpredictability or element of chance inher- 
ent in the system, the test deals with the probabilities that 
events occur or that hypotheses are true. The probability that 30 
a hypothesis is true given a piece of evidence is written as 
P(Hle). The notation identifies a conditional probability — 
namely the probability that the hypothesis is true under the 
condition that the event has occurred with absolute certainty. 
Bayes’ well known inversion rule, as described in Probabi- 35 
listic Reasoning in Intelligent Systems: Networks of Plau- 
sible Inference, by Judea Pearl, Morgan Kaufmann Publish- 
ers, Inc., San Mateo, Calif., 1988, Second Edition at pages 
29 through 39, provides a means for evaluating the condi- 
tional probability of a hypothesis, P(Hle), which is also 40 
known as the posterior probability, 


P(H\e) = 


P(e\H)P(H) 

P(e) 


(E7) 

45 


Bayes’ rule can be simplified by eliminating the denomi- 
nator in equation E7. Combining equations E8 and E9, and 
substituting e for R and H for S, the denominator can be 
written as 

P(e)=P(e\H)P(H)+P(er~lH)P(~lH) (E10) 

Therefore, Bayes’ rule becomes 


T\H\e) = 


P(e\H)P(H) 

P(e\H)P(H) + P(e\ I H)P( I H ) ' 


(Ell) 


The power of Bayesian techniques comes primarily from 
the fact that in causal reasoning the relationship P(elH) is 
local, namely, given that H is true, the probability of e can 
be estimated naturally and is not dependent on many other 
propositions. For instance, given that the measurements 
from an experiment adhere to a particular PDF, the prob- 
ability that any single measurement will occur is easily 
computed. 

The complementary form of Bayes’ rule provides the 
posterior probability for the converse of the hypothesis. It is 
evaluated by substituting “~|H for each instance of H in 
equation Ell and noting that “ |(^HH)=H: 


PC^H\e) 


P{e | I H)P{ I H) 

P{e | H)P(H) + P{e | I H)P{ I H ) ' 


(El 2) 


An alternate form of Bayes’ rule is produced by dividing 
it (i.e., equation Ell) with its complementary form (i.e., 
equation El 2) to obtain 


P(H \e) _ P(e\H)P(H) (El 3) 

P( \H\e) ~ P{e | I H)P(H)P( I H ) ' 


This form of Bayes’ rule is further manipulated by first 
defining the prior odds on hypothesis H as 


P(elH) is the probability that the observation would occur 
if the hypothesis is true. P(H) is the probability that the 
hypothesis is true before any observations of the system 
have been made, also known as the prior probability. The 
denominator, P(e), is the unconditional probability that the 
observation occurs. 

Basic axioms of set theory can be used to prove the 
following identity for two events R and S: R=(RnS)U 
(RD — IS), where “IS is the converse of event S. In 
probability theory, the analog of this identity is 

P(R)=P(R,S)+P(R, I S), (E8) 


0{H) = 


P(H ) 

P( I#)’ 


the likelihood ratio as 


Ue\ H) = 


P(e\H ) 

P{e I I H) ’ 


and the posterior odds on H as 


(El 4) 


(El 5) 


where the notation P(R,S) is used represent the probabil- 
ity of the joint event RPlS. The multiplication law states that 
the probability of two events occurring jointly can be 
expressed as a function of the conditional probability of one 
event based on the other, 

P(R,S)=P(R I S)P(R) (E9) 

If R and S are independent events, then P(RIS)=P(R) and 
the multiplication law simplifies to P(R,S)=P(R)P(S). 


60 


0(H\e) = 


P(H\e) 

P{ 17/ | e) ' 


(El 6) 


Bayes’ rule then specifies the posterior odds as the 
65 product of the likelihood ratio and the prior odds, 


0{H\e)=L(e\H)0(H). 


(El 7) 
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Bayes’ Rule for a Time Series: 

The formulation above specifies Bayes’ rule for a single 
observation and a binary hypothesis. For the application of 
binary hypothesis tests to real world signals, the formulation 
must be able to handle a sequence of discrete observations. 5 
This is accomplished by beginning with a single observation 
and successively updating Bayes’ rule for each successive 
observation. Let the sequence {Y„} be an ordered set of n 
elements, {Y w }=y 1 , y 2 , . . . , y„, in which the elements are 
observations of the signal made at n discrete moments in 
time such that t x <t 2 < . . . <t w . Bayes’ rule for the first 
observation (yj in the time series is 


P(H\ yi ) = 


P( yi | H)P(H) 

P(yi) 


(El 8) 


Adding the second observation from the time series, 
Bayes’ rule for the joint event y 1 Ply 2 is 


-continued 


n 

= /wf[ 

i=l 


Pjyi I H) 
P(yi ) 


If an explicit time dependency can be established for the 
observations in the time series (i.e., a function is found that 
relates earlier events to later events), then the general form 
of Bayes’ rule (equation E21) can be used to develop failure 
detection models for serially-correlated signals. However, 
the residual signals formed by the difference between the 
observed and estimated signal values are in general random 
signals; thus, Bayes’ rule for random time series is used as 
the basis for the fault detection models. 

Dividing by the complementary form of Bayes’ rule for 
random time series and utilizing the definition of the pos- 
terior odds, prior odds, and likelihood ratio from above, an 
alternate form of Bayes’ rule for a time series is developed: 


P(H\yi,y 2 ) = 


P(yi, y 2 1 H)P(H) 
P(yu yi) 


(El 9) 
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0(H |yi,^ 2 y n ) = 0(H)\\ Uy, \ H). 


(E23) 


With the aid of the multiplication law (equation E9), the 
joint event probabilities are converted to conditional prob- 
abilities so that the right hand side of this equation can be 
rewritten as 30 


x P(y 2 \y l ,H)P(y l \H)P(H) (E20) 

P(H\yi, y2 ) = ^ — j — — : • 

P(y2 1 vi)^(vi) 

35 


Note that the probability of the joint event yiftyj is 
written as P(y l5 y 2 )=P(y 2 ly 1 )P(y 1 ) instead of the equivalent 
p(yi, y 2 )=P(yily 2 )P(y 2 ) because of the temporal dependency 
of the data. The second form of the multiplication law 4Q 
reduces to P(y l5 y 2 ) = P(yi) P(y 2 ) because earlier events (e.g., 
y : ) in a time series cannot be dependent on later events (e.g., 

y 2 )- 

The multiplication law is used for each successive obser- 
vation in the time series to derive the form of Bayes’ rule for 45 
the joint event (yiriy 2 n . . . Hy„): 

P(H\y u y 2 ,... ,y n )= (E21) 

P(y n I y n -i , • • • ... Pjy 2 \ yi , H)P(yi \ H)P{H) 50 

P{y n I .y«-i , ■ ■ ■ ,y0--- P(y 2 1 yOEfyi) 


Since the probabilities for each of the later observations in 
the time series are conditioned on earlier observations, 55 
Bayes’ rule is difficult to solve in the general case. But if the 
observations in the series are independent of each other (i.e., 
random), then the probabilities will be dependent on the 
hypothesis only. In this case the conditional probability for 
the ith observation, P(y 1 -ly f _ 1 , . . . , y 1? H), is simply P(y z IH). 60 
Thus, Bayes’ rule for independent observations in a time 
series is 


P(H\ yi ,y 2 ,... ,y n ) = 


P(y n \H) ... P( yi \H)P(H) 

P(y n ) - P(yi ) 


(E22) 65 


If we take the logarithm of this equation, the incremental 
nature of Bayesian formulation becomes more apparent. 
Equation E24 shows the log of the likelihood ratio as a 
weight, carried by each observation in the sequence, which 
additively sways the belief in the hypothesis one way or the 
other. 


A (E24) 

In [0(H | y u y 2 , ... , y n )] = In [0(H)] + [[ ln[L(^ | H)] 


Sequential Hypothesis Tests: 

Wald first presented and studied the following sequential 
test of a simple hypothesis against a simple alternative, as 
described in Sequential Analysis, by A. Wald, John Wiley 
and Sons, Inc., New York, 1947. Let H 0 be a specific 
probability density function called the null hypothesis. Then 
the probability that the time series {Y w } contains samples 
drawn from H 0 is P(y l5 y 2 , . . . , yJH 0 ). Let H, be a different 
probability density function called the alternative hypoth- 
esis. Then the probability that the time series {Y„} contains 
samples drawn from H,- is P(y l5 y 2 , . . . , yJH^). Two 
threshold limits A and B are chosen, with A<B, and after 
each observation in the series the following statistic (A^) is 
calculated: 


^ P( yi ,y 2 ,... , y n I Hj) 
Jn P(yi,y , ■■■ ,y n l^o) ' 


(E25) 


The test procedure is then as follows. If the statistic is 
greater than or equal to the upper threshold limit (i.e., 
A^i^B), then a decision is made to accept hypothesis H^ as 
true. If the statistic is less than or equal to the lower 
threshold limit (i.e., Aj^A), then a decision is made to 
accept hypothesis H 0 as true. If the statistic falls between the 
two limits (i.e., A<Aj s „<B), then neither hypothesis can yet 
be accepted to be true and sampling continues. If the 
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observations in the series are independent random variables, 
then the test statistic reduces to a product of conditional 
probabilities: 


o P(yi I Hj) 

jn 


(E26) 


In the sequential hypothesis tests, the logarithm of the test 
statistic is often easier to work with: 





P{yi I Hj) 
P(y> I H 0 ) 


(E27) 


Assuming the observations in the time series are inde- 
pendent, then Bayes’ rule conditioned on hypothesis H^. (i.e., 
equation E22 with the symbol H replaced by H.) can be 
divided by Bayes’ rule conditioned on hypothesis H 0 to 
5 produce Bayes’ rule for a sequential hypothesis test: 


10 


n 

Oj(H 0 |^,y 2 y n ) = Lj(y, \ H 0 ). 


(E31) 


Dividing this equation through by the prior odds, it 
becomes apparent that the statistic in the sequential hypoth- 
esis test is just the ratio of the posterior odds to the prior 
15 odds: 


The sequential hypothesis test consists of calculating the 
logarithm of the test statistic for each observation in the 
series and comparing the result to the logarithms of the 20 
lower and upper threshold limits. 

The statistic in the sequential hypothesis test is a product 
of a sequence of likelihood ratios. Each term in the product 
is the ratio of a probability conditioned on one hypothesis to 
a probability conditioned on a second hypothesis. The 
difference between the likelihood ratios in the sequential 
hypothesis tests and those in Bayes’ rule (see equations El 5 
and E23), is that in Bayes’ rule the probabilities are condi- 
tioned on a hypothesis and its converse, whereas in the 
sequential hypothesis tests the probabilities are conditioned 
on two hypotheses from a set of related hypotheses. In 
principle, the two hypotheses in the sequential hypothesis 
tests could be the converse of each other (i.e., the set 
contains two elements), but in practice the hypotheses are 35 
selected from an infinite set of exhaustive and mutually 
exclusive hypotheses. For instance, suppose the null hypoth- 
esis is a Gaussian PDF with a mean of 1 and a variance 10. 
Then a sequential hypothesis test can be defined in which the 
alternate hypothesis is any Gaussian PDF in which the mean ^ 
is not 1 and/or the variance is not 10. 

It is informative to compare the sequential hypothesis 
tests to Bayes’ rule for a time series. Given a null hypothesis 
H 0 and an alternative hypothesis H^., the likelihood ratio for 
an observation e conditioned on the two hypotheses is 45 
defined as 


Lj(e | H 0 ) = 


Pie | Hj) 
Pie | H 0 ) m 


(E28)) 
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The subscripts emphasize the fact that the two hypotheses 
are selected from an infinite set of related hypotheses. The 
prior odds for the two hypotheses are defined as 


0j{H o ) ■■ 


P{Hj) 

p{H 0 y 


(E29) 
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while the posterior odds are defined as 


0j{H o | e) = 


P(Hj | e) 
P(H 0 | e)' 


(E30) 
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0j{H o \y u y 2 , ... ,y„) 
0j(H o ) 


H A/O'; I «.) - A j.n- 


(E32) 


The prior odds are the ratio of the probability that the 
alternative hypothesis is true to the probability that the null 
hypothesis is true, before any data have been collected from 
the system. In many cases, these probabilities are not 
known — no prior information about the system is known. In 
the absence of prior information about the system, these 
probabilities are taken to be ¥ 2 , making the prior odds equal 
to 1. Thus in the absence of prior information, the test 
statistic \j n is equal to the odds that the system adheres to 
the alternative PDF as opposed to the null PDF. 

Error Probabilities for the Sequential Hypothesis Tests: 

Because the sequential hypothesis tests make decisions 
based on probabilities, there is always a finite probability 
that a decision reached by the test is erroneous. If a sequen- 
tial hypothesis test makes a decision to accept the alternative 
hypothesis H, as true when the null hypothesis H 0 is true, 
then an error of type I is said to have occurred. If a sequential 
hypothesis test makes a decision to accept the null hypoth- 
esis H 0 as true when the alternative hypothesis H, is true, 
then an error of type II is said to have occurred. Although the 
designation is arbitrary, it stems from situations in which one 
kind of error is more serious than the other. Since the normal 
usage of the sequential hypothesis tests is to detect a change 
in signal response from its normal behavior (i.e., hypothesis 
H 0 ) to some abnormal behavior (i.e., hypothesis H^), the 
error of accepting H, when H 0 is true is the more serious 
error. The probability that a decision to accept hypothesis 
is erroneous is denoted by a. A type I decision error is also 
called a false alarm and the probability of a type I error is 
called the false alarm probability. The probability that a 
decision to accept hypothesis H 0 is erroneous is denoted by 
p. A type II decision error is also called a missed alarm and 
the probability of a type II error is called the missed alarm 
probability. 

The sequential hypothesis tests are open-ended. The tests 
will continue to collect observations from the system and 
update the test statistic until the test statistic satisfies one of 
the two decision conditions. In principle, the number of 
observations needed to reach a decision can be any positive 
integer, although it can be shown that a decision will be 
reached in a finite number of observations. Since the number 
of observations needed to make a decision is indeterminate, 
the probability that a decision is erroneous is found by 
summing the probability of an erroneous decision being 
made after 1 observation, 2 observations, and so on. For- 
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mally, in terns of the threshold limits A and B that define the 
test, the false alarm probability is given by: 

a=P{Aj^B\H 0 )+P{B<A ja <A, Aj 2 =B\H 0 )+ (E33) 

The first term in the sum is the probability that the test 5 
statistic drops below the lower threshold limit after only one 
observation given that the alternative hypothesis is true. The 
second term is the probability that the test statistic drops 
below the lower threshold limit after two observations given 
that the alternative hypothesis is true. Similarly, the missed 10 
alarm probability is given by: 

^=P(A J ^A\H J ) + P(B<A ja <A, Aj2<A\Hj)+ (E34) 

These expressions are by no means easily computed. 
Moreover, one could not hope to solve these equations for A 15 
and B in terms of given a and p, despite the desirability of 
being able to do so in setting up a test to provide a specified 
protection. Although these equations cannot be solved, it can 
be shown that the error probabilities and the threshold limits 
are related by the following inequalities 20 


a 1/3 (E35) 

— — and — > — 


The error probabilities and the threshold limits are related 
by inequalities because the test statistic A does not usually 
attain exactly the value A or the value B when the test is 
completed. But since a decision is declared as soon as an 
observation drives the test statistic past either threshold, the 
inequalities are almost equalities. Indeed, in practice, A and 
B are taken to be equal to a/(l-p) and (l-a)/p, respectively. 
Doing so, of course, means that the sequential hypothesis 
test actually carried out has error probabilities that are 
somewhat different than those specified. Let a’ and p’ denote 
the empirical error probabilities actually attained by a test 
using specified threshold limits of A=a/(l-p) and B=(l-a)/ 
p. Then according to the inequalities in equation E35, the 
empirical error probabilities (i.e., a' and p') are related to the 
preassigned error probabilities (i.e., the values of a and p 
used to specify the threshold limits) by 


a' a ft 1 f3 

< A = and < — = . 

(l~ft)~ (1-0 (l-a')-B (1 — or) 


(E36) 
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Multiplying these through to eliminate denominators, one 
obtains 

50 

a'-a'fJ^a-afV and fV-aA'^p-a'p. (E37) 

Adding these two equations together, one obtains an 
inequality relating the empirical error probabilities to the 
preassigned error probabilities: 

a'+P’^a+p. (E38) 55 

One of the key features of the hypothesis test technique is 
that the designer can specify the error probabilities. This is 
particularly important for type I errors, because false alarms 
can cause an operator to make an incorrect decision. Type II 60 
errors typically do not lead to incorrect decisions. This is 
because in the event that a real failure does occur, missed 
alarms may delay the time to detect the failure but not the 
ability to detect the failure. The result above (equation E38) 
shows that the preassigned false alarm probability is not a 65 
strict upper limit for the empirical false alarm probability. 
Similarly, the preassigned missed alarm probability is not a 


strict upper limit for the empirical missed alarm probability, 
even when the hypothesis test is applied to purely random 
data. It is the sum of the preassigned error probabilities that 
is an upper limit for the sum of the empirical error prob- 
abilities. Thus it is possible with purely random data for one 
of the empirical error probabilities to exceed its correspond- 
ing preassigned error probability, but both empirical error 
probabilities cannot be greater than their corresponding 
preassigned error probabilities. 

True upper bounds for the empirical error probabilities 
can be determined from the inequalities in equation E36, 


1 ( 1 - 0 ) “( 1-0 


and ft < 


P . P 

(1 - o') ~ (1- a) ‘ 


(E39) 


For small preassigned error probabilities, the true upper 
bounds are only slightly greater than the preassigned error 
probabilities. For instance, if both of the preassigned error 
probabilities are 0.01 then the empirical error probabilities 
of the test will not exceed 0.0101. 

The Sequential Probability Ratio Test (SPRT): 

The sequential hypothesis tests described herein above are 
general statistical tests valid for any pair of related hypoth- 
eses. MSET has historically employed four specific sequen- 
tial hypothesis tests to detect signal faults. These four tests 
are called the Sequential Probability Ratio Tests, or SPRTs. 
The SPRTs monitor for changes in the statistical character- 
istics of the residual signals. A residual signal is the differ- 
ence between an actual signal and MSET’s estimate of that 
signal. The SPRTs continually monitor a residual signal, 
generating sequences of decisions. A decision in which the 
null hypothesis is accepted (i.e., A^ „=A) is called a normal 
decision and implies that the residual signal is behaving as 
anticipated. A decision in which the alternative hypothesis is 
accepted (i.e., A y W ^B) is called a fault decision and implies 
that the residual signal is behaving abnormally. 

The null hypothesis upon which the SPRTs are based 
specifies that the residual signal consists of Gaussian data 
that have a sample mean of 0 and a sample variance of cr 2 . 
A training procedure, during which the system is operating 
normally, is used to verify that the mean of the signal is 0 and 
to evaluate the variance of the signal. Note that if the 
residual signal does not have a mean of 0, the calculated 
mean from the training phase is used to normalize the 
residual signal for the surveillance procedure using the 
model. Thus the null hypothesis, H 0 , for the SPRTs is that 
the signal being analyzed adheres to Gaussian PDF (N(y; \ 1 , 

a 2 )), 


N(y\ ft, a 2 ) = - 




exp 


2 cr 2 


(y-pY 


(E40) 


for which the mean (p) is 0 and the variance is a 2 . Thus, 
the conditional probability that a discrete observation (y z ) 
occurs given the null hypothesis is expressed as 

B(yfti 0 )=N(y i ; 0, a 2 ). (E41) 

The four SPRT tests historically used with MSET are the 
positive mean test, the negative mean test, the nominal 
variance test, and the inverse variance test. For the positive 
mean test, the corresponding alternative hypothesis, H 1; is 
that the signal data adhere to a Gaussian PDF with mean +M 
and variance cr 2 , where M is the preassigned system distur- 
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bance magnitude for the mean test. For the negative mean 
test, the corresponding alternative hypothesis, H 2 , is that the 
signal data adheres to a Gaussian PDF with mean -M and 
variance a 2 . Referring to FIG. 10, the hypotheses for the two 
mean tests are illustrated for M=2 and a 2 = 1 . For the nominal 5 
variance test, the alternative hypothesis, H 3 , is that the signal 
data adheres to a Gaussian PDF with mean 0 and variance 
Vcr 2 , where V is the preassigned system disturbance mag- 
nitude for the variance test. For the inverse variance test, the 
corresponding alternative hypothesis, H 4 , is that the signal 10 
data adheres to a Gaussian PDF with mean 0 and variance 
a 2 /V. Referring to FIG. 11, the hypotheses for the two 
variance tests are illustrated for V=2 and o 2 = 1. The condi- 
tional probabilities that a discrete observation (y z ) occurs 
given one of the four alternative hypotheses are expressed as 15 

P{yi I Hi) = N(yr, M, a 2 ), 

P(y> I H 2 ) = N(yr, -M, a 2 ), 

P(yi I h 3 ) = N(yr, o, Vo- 2 ), 

P(y> I H 4 ) = 0, y | 

The logarithm of the test statistic for the four SPRT tests 
can be evaluated by substituting the conditional probabilities 
from equations E41 and E42 into the general formula 
(equation E27) and simplifying. Thus, for the positive mean 
test, 


(E 42 ) 

20 

25 


lnAi,« = 


(E 43 ) 


Z i \N(yr, M,^)' 
ln N(yr, 0 , cr 2 ) 



In 


expf-U^-M) 2 ) 



35 


Z yf - (yi - M ) 2 
2a- 2 

i=i 
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The equation E43 defines the SPRT index for the positive 
mean test (SPRT^J. A SPRT index is defined for each of the 45 
four sequential hypothesis tests. The SPRT indices are the 
actual quantities computed by the SPRT fault detection 
procedure. The SPRT index for the negative mean test 
(SPRT„ eg ) is given by 


lnA 2 ,n = 


Z rWto; M, cr 2 )' 
11 N(yr, 0 , o- 2 ) 


M 1 / M \ 

yZh-y) 5 "* 


(E 44 ) 
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The SPRT index for the nominal variance test (SPRT wom ) 
is given by 


lnA 3 ,« = 


Z \ N( yi -0, Vcr 2 ) 
n N(yr, 0 , o- 2 ) 



1 1 

yf )] 

|> 

V htVcr 2 P I 

2W 2 | 

1 1 
—== exp 
V2mx 2 

L 2cr 2 J 


(E 45 ) 
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-continued 




V- 1 


V V?- 


2 Vcr 2 Z-J ■ 


-InV = SPRT„ n 


The SPRT index for the inverse variance test (SPRT IWV ) is 
given by 


lnA.4 ; „ = 



In 


\ o - 2 ) 

N [y- o- t 

N{yi\ 0 , cr 2 


(E 46 ) 
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Novel Improvements to the Fault Detector Training and 
Surveillance Procedures: 

Having described the SPRT fault detector training and 
surveillance methods herein above, the section below 
describes the novel improvements made by one embodiment 
of the invention when used for fault detector training and 
surveillance, said improvements being applicable to any 
asset preferably having at least one characteristic data signal. 

The Adaptive Sequential Probability (ASP) Test: 

The SPRT technique is valid only for residual signals that 
are random and adhere to Gaussian distributions. Because 
the MSET parameter estimation algorithm simulates the 
deterministic components of a set of inter-related raw sig- 
nals, the residual signals that are produced within MSET are 
often white-noise processes (i.e., not serially-correlated). 
Although the MSET residual signals are typically white, 
they are often non-Gaussian (not normally distributed). If 
the SPRT fault detection model is applied to signals that are 
either non-Gaussian, non- white, or both, the theoretical false 
and missed alarm probabilities may not be achieved. The 
novel advanced fault detection method of one embodiment 
of the invention, herein named the Adaptive Sequential 
Probability (ASP) technique or method, solves the problem 
associated with non-Gaussian, non- white residual signals. 
The ASP method uses binary hypothesis tests that are 
numerically tuned to better accommodate the non-Gaussian 
data distributions that are typical for residual error signals 
produced by parameter estimation based techniques, such as 
MSET. 

Mathematical Foundations of the ASP Test: 

The Adaptive Sequential Probability (ASP) method 
defines four new sequential hypothesis tests. The Adaptive 
Sequential Probability method is an advanced failure detec- 
tion technique that broadens the domain of applicability of 
the SPRT technique to non-Gaussian PDFs. In the ASP 
method, the assumption that the data fit a Gaussian PDF is 
relaxed and the test statistic is evaluated for any arbitrary 
data distribution. In the ASP method, the signal is assumed 
to consist of random observations that adhere to a specific 
PDF that is a function of the sample mean, variance, and 
possibly higher order terms. The PDF is denoted by the 
general function S(y; p, cr 2 , . . . ). The parameter list of the 
function is open-ended to indicate that additional terms, 
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such as the sample skewness, kurtosis, or width of the The ASP index for the negative mean test (ASP„ e ^) is 

distribution at half-maximum, may be required to charac- given by 
terize the function. 


The null hypothesis upon which the ASP tests are based 5 
specifies that the data distribution has a sample mean of 0 
and a sample variance of cr 2 . A training phase, during which 
the system is operating normally, is used to verify that the 
mean of the signal is 0 and to evaluate the sample variance. 

If the PDF is dependent upon any other additional terms, 
they are also evaluated during the training phase for numeri- 
cally tuning one or more probability functions obtained by 
fitting an open-ended general probability functions to one or 
more data distributions obtained during typical or normal 15 
operating conditions of the asset under surveillance. Thus, 
the conditional probability that a discrete observation (y z ) 
occurs given the null hypothesis is expressed as 

P(y,\H 0 )=Z(y,; 0, o 2 , . . . ). (E47) 20 


tnA ^ = A‘1 


= ASP ne 


The ASP index for the nominal variance test (ASP 
given by 


(E50) 


nom ) i S 


cr 2 , ...) 


ASP nom ■ 


(E51) 


The ASP index for the inverse variance test (ASP ZMV ) is 
given by 


Typically, four sequential hypothesis tests (i.e., a positive 
mean test, a negative mean test, a nominal variance test, and 
a inverse variance test) are utilized by the ASP method. For 
the positive mean test, the corresponding alternative hypoth- 
esis, H l5 is that the signal data adhere to the specified PDF 
with mean +M and variance a 2 where M is a preassigned 
system disturbance magnitude for the mean test. For the 
negative mean test, the corresponding alternative hypoth- 30 
esis, H 2 , is that the signal data adheres to the specified PDF 
with mean -M and variance a 2 . For the nominal variance 
test, the corresponding alternative hypothesis, H 3 , is that the 
signal data adhere to the specified PDF with mean 0 and 
variance Va 2 where V is a preassigned system disturbance 35 
magnitude for the variance test. For the inverse variance test, 
the corresponding alternative hypothesis, H 4 , is that the 
signal data adhere to the specified PDF with mean 0 and 
variance o^/V. The conditional probabilities that a discrete 40 
observation (y z ) occurs given one of the four alternative 
hypotheses are expressed as 

P(y i \M l ) = 3(yr,M,cr 2 ,... ), 

P(y i \H 2 ) = 3(y i ;-M,o- 2 ,... ), 

P(yi\H 3 ) = 3(yr,0, Vcr 2 , ... ), 

P(y> I H 4 ) = '3 ^/;0, y, ... j 


(E48) 45 
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The logarithm of the test statistic for the four ASP tests 
can be evaluated by substituting the conditional probabilities 55 
from equations E47 and E48 into the general formula 
(equation E27). The logarithm of the test statistic for a given 
test is defined to be the ASP index for that test. The ASP 
index for the positive mean test (ASP /JO J is given by 


InA 


n 

-Z' 


3 (.y/;M,£r 2 , ...) 


3 0, cr 2 , ...) 


(E49) 
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3 (yi\ 0, cr 2 , ...) 


= ASP in 


(E52) 


The ASP tests are then implemented in the same manner 
as the SPRT tests. Namely, for each time step in the 
calculation, the four ASP indices are calculated (equations 
E49 through E52). Each ASP index is compared to the upper 
and lower log-thresholds and the status of the test is evalu- 
ated (i.e., faulted, normal, or continuing). 

ASP Method for Near-Gaussian Distributions: 

The Adaptive Sequential Probability method consists of 
four specific sequential hypothesis tests applied to non- 
Gaussian data distributions. In order to use the method, a 
general PDF, £$(y; p, cr 2 , . . . ), must be defined for the target 
signal. In this section, the ASP method is derived for data 
distributions that are nearly Gaussian. 

In applications of MSET to the Space Shuttle Main 
Engine accelerometer signals, applicants discovered that the 
residual signals produced by the system model have a nearly, 
but not truly, Gaussian behavior, as described in Dynamics 
Sensor Data Validation Phase I Final Report, by Randall L. 
Bickford and James P. Herzog, NASA Contract NAS 8- 
40874, Jul. 1, 1997. When plotted as a histogram, the 
residual data will appear to have the same bell-curve shape 
as a Gaussian distribution. But when a Gaussian PDF of the 
same mean, standard deviation, and area as the data distri- 
bution is superimposed on it, the histogram was found by 
applicants to be non-Gaussian. Typically, applicants found 
that the histogram has thicker tails than the Gaussian curve, 
which corresponds to a sample kurtosis that is greater than 
3. 

A PDF can be written as a sum of a standard normal 
distribution, Z(x), and a remainder term, R(x), as described 
in Mathematical Methods of Statistics, by H. Cramer, Prin- 
ceton University Press, Princeton, N.J., 1946. 

3(jc)=Z(jt)+i?(jt), (E53) 

where the standard normal PDF 


Z(x) 



(E54) 
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is a function of the dimensionless variable x=(y-p)/cr. The 
standard normal PDF is related to the general Gaussian PDF 
(equation E40) through the normalization condition: 


N(y ; fi, o 2 )dy = 1. 


The remainder term can be expressed as an infinite series 
of orthogonal polynomials whose terms are functions of 
derivatives of the standard normal PDF. The constant coef- 
ficients of the polynomial terms are dependent on the central 
moments of the target distribution. As described in Hand- 
book of Mathematical Functions, by M. Abramowitz and I. 
A. Stegun, Dover Publications, Inc., New York, 1972, the 
first four terms in the series expansion of the remainder 
function are: 


^10(jc)=x 1o -45x 8 +630x 6 -3150x 4 +4725x 2 -945 ? 

^ 11 (x)=x 11 -55x 9 +990x 7 -6930x 5 +17325x 3 -10395x, 

He l2 { jc)=x 12 -66x 10 +1485 x 8 - 13 860x 6 +5 1 975x 4 - 

62370x 2 +10395. (E59) 

Each term in the series expansion of the remainder 
function R(x) is more complex than the previous term as 
higher central moments of the data distribution and higher 
derivatives of the standard normal PDF are introduced. The 
nth term in the remainder function depends on the central 
moments from order 1 (i.e., mean) through order n+2. Thus, 
the first term depends on moments through the third central 
moment (i.e., skewness), the second term depends on 
moments through the fourth central moment (i.e., kurtosis), 
and so on. 

If the data distribution that is to be approximated with 
equation E53 is nearly Gaussian, a good approximation will 


»■" ■ * [i(5 - 5)z“»* - 

[i($ - 15^ 


The p. factors are central moments of the discrete data be achieved with only a few of the terms in the series. In the 


sequence {Y„}=(y 1; y 2 , , y„), 


jYi with H = yj and a a 2 = O'; ~/<) 2 


The Z (n, (x) functions are derivatives of the standard 
normal PDF. The nth derivative of the standard normal PDF 
is given by: 


&\x) = = (-1 THe n {x)Z{x), 


ASP method for near-Gaussian distributions, the data dis- 
tribution is first approximated with the standard normal PDF, 
35 Z(x). The approximation is refined by adding successively 
higher terms of the remainder function until an adequate fit 
between the general PDF, S(x), and the data distribution is 
produced. In practice it is usually not advisable to go beyond 

. the fourth term in the remainder function because the tail 

40 

regions of the general PDF become unstable. 

Using the equations for the derivatives of the standard 
normal PDF (equations E58 and E59), the remainder func- 
tion can be written as a product of the standard normal PDF 
45 and a polynomial whose coefficients depend on the central 
moments of the data distribution. 


where He„(x) are the Hermite polynomials. The first 50 A 

twelve Hermite polynomials are ~ 

He 1 (x)=x, 


He 2 (x)=x 2 -1, 


where the first four terms in the series are 


He 3 (x)=x 3 -3x, 


He 4 (x)=x 4 - 6x 2 +3 , 


He 5 (x)=x 5 - 1 0x 3 + 1 5x, 


He 6 (x)=x 6 -15x 4 +45x 2 -l5, 


He 1 (x)=x 1 - 21x 5 +l 05* 3 - 1 05x, 


77e 8 (jc)=Jc 8 -28jc 6 +2 10^-420^+1 05 , 


He g (x)=x 9 -3 6x 7 +3 78x- 1 2 60x 3 +945x, 


n{ x)=-‘- H ^x), 


^w4(7- 3 K w 4(^) ! "'« w ’ 


r3W= 4S- 10 ^K w+ 
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Thus, the series expansion formula for the approximation 
of the general PDF is 


10 


(x) * Z(x) + Z(x)^ n(x). 


(E65) 15 


Application of the ASP Method to Space Shuttle Main 
Engine Residual Signals: 

In one application of use and operation, and in one 
embodiment of the invention, the ASP technique was 
reduced to practice to provide highly reliable data validation 
for high frequency Space Shuttle Main Engine (SSME) 2 s 
accelerometers. Each SSME is monitored by eight high 
frequency (10.24 kHz) accelerometers with three sensors 
mounted on each of the two high pressure turbopumps and 
two sensors mounted on the engine gimbal bearing. Several 
MSET models were prepared to validate the ASP method for 30 
the six turbopump accelerometers. The MSET pattern rec- 
ognition algorithm produces extremely accurate simulations 
of turbopump accelerometer behavior. Referring to FIG. 12, 
some results from applying the six sensor MSET model to 
a typical Space Shuttle flight (i.e., flight STS057, engine 1) 35 
are shown. For the calculation shown in FIG. 12, the MSET 
training algorithm extracted a small fraction of the data in 
the accelerometer signals to simulate the behavior of the 
accelerometers (the process memory matrix contained only 
0.0083% of the raw signal data). In the top plot, the highly 
dynamic signal from the first SSME turbopump accelerom- 
eter in the model (i.e., the oxidizer prebumer pump 45 
accelerometer) is shown as a function of time from the 
launch of the Space Shuttle. In the middle plot, MSET’s 45 
estimate of the accelerometer signal shows the very high 
accuracy of the MSET simulation. The residual signal for the 
calculation, which is the difference between the sensor 
signal and MSET’s estimate of that signal, is shown in the 
bottom plot. The relative error in the calculation, as mea- 50 
sured by the ratio of the standard deviation of the residual 
signal to the standard deviation of the sensor signal, is only 
0.86%. The results shown in FIG. 12 illustrate MSET’s 
ability to accurately model highly dynamic signals, even 
when only a tiny fraction of available data is used to 55 
represent the signal behavior. 

Still referring to FIG. 12, the residual signal for the 
analysis of a typical SSME accelerometer appears to be 
random. Referring to FIG. 13, the distribution of data in the 6Q 
residual signal is shown. The probability density for the 
residual signal was approximated by dividing the data range 
into m equally-sized bins and then counting the number of 
residual signal data values that fall into each bin. The count 
number is a discrete function of the sequence {Y m }=y l9 65 
y 2 , • • • , y m , where the ith element in the sequence (y z ) is 
given by 


. (^max ^min) (E66) 

yi ymin * 5 


and y min is the minimum datum and y max is the maximum 
datum in the residual signal. A normalized residual signal 
PDF is produced by dividing each element in the count 
number function by the total number of elements in the 
residual signal. The number of bins used to produce the 
residual signal PDFs was 1000. As shown in FIG. 13, a PDF 
for a typical residual signal has a nearly Gaussian shape. 
Superimposed on top of residual signal PDF is a Gaussian 
PDF of the same mean, standard deviation, and area as the 
residual signal PDF. The first four moments of the residual 
signal data are as follows: mean=l .1*1 0 -4 g, standard devia- 
tion=8.2*10 -2 g, skewness=0.32, and kurtosis=5.7, where 
the skewness and kurtosis are related to the third and fourth 
central moments of the data by 


skewness = ^ and kurtosis = ^ . 

cr 5 cr 4 


(E67) 


Because the kurtosis of the distribution is greater than 3, 
the residual signal PDF has thicker tail regions than a true 
Gaussian distribution. This is confirmed by a visual exami- 
nation of FIG. 13. 

The ASP method for near-Gaussian distributions was 
applied to the residual signals from the MSET model of the 
SSME accelerometers. The residual signal PDFs were 
approximated with the series expansion formula of a general 
PDF (equations E60 through E65). Four calculations were 
performed for each residual signal in the model. In the first 
calculation, the residual signals were approximated with the 
one-term series expansion formula. In the one-term formula, 
the remainder function (R(x)) is given by the product of the 
standard normal PDF and the first term in the series, r 1 (x). 
Subsequent calculations introduced additional terms from 
the series, culminating with the four-term formula in which 
the remainder function is given by the product of the 
standard normal PDF and the four term series, r 1 (x)+r 2 (x)+ 
r 3 ( x )+r 4 (x). 

For each calculation, the quality of the approximation is 
measured by the root mean- squared (ims) error of the 
calculation. The rms error (E rm J is a function of the differ- 
ence between the calculated PDF (£s(x)) from equation E65 
and the residual signal PDF (F(x)), 


-Y (3 (Xi)-F(Xi)) 2 

m4-4 


(E68) 


The calculated and residual signal PDFs are discrete 
functions of the dimensionless variable x.=(y.-p)/a, where 
y is an element in the sequence that spans the range of data 
in the residual signal distribution (see equation E66). 

In general terms, each additional term in the series expan- 
sion improves the approximation of the residual signal 
PDFs. Referring to FIG. 14, FIG. 15 and FIG. 16, adding 
terms from the series expansion formula improves the fit of 
the residual signal PDF from the first sensor. The approxi- 
mation of the residual signal PDF generated by the one-term 
series expansion formula is compared to a Gaussian approxi- 
mation and the residual signal PDF in FIG. 14. The one-term 
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approximation shows little improvement over the Gaussian 
approximation. The rms error for the Gaussian PDF is 0.414, 
while the rms error for the one-term series expansion PDF 
is slightly smaller at 0.409. The PDF generated by the 
two-term series expansion formula is shown in FIG. 15. The 5 
fit of the two-term approximation is significantly better than 
that of the Gaussian approximation. The two-term formula 
provides an excellent fit of the residual signal PDF, espe- 
cially near the peak in the distribution. The greatest source 
of error in the approximation occurs in the transition regions to 
of the distribution, between 1 and 4 standard deviations from 
the mean. The rms error for the two-term series expansion 
PDF is 0.122, which is nearly a factor of four smaller than 
the rms error of the Gaussian PDF. The PDF generated by 
the three-term series expansion formula is shown in FIG. 16. 15 
The three-term formula provides a better fit of the peak in 
the distribution than does the two -term formula. The rms 
error for the three-term approximation is 0.151, which is 
slightly larger than that of the two -term approximation 
because its fit of the lower transition region is less accurate 20 
than that of the two-term approximation. 

The rms errors of the four calculations for each of the 
sensors in the model are listed in FIG. 17. Also included in 
FIG. 17 are the rms errors for Gaussian PDFs of the same 
mean, standard deviation, and area as the residual signal 25 
PDFs. As indicated by the data in the first two columns of 
FIG. 17, the one-term series expansion formula produces 
PDFs that are slightly more accurate than the corresponding 
Gaussian PDFs. Significant improvement is exhibited by the 
two-term series expansion formula, which produces PDFs 30 
whose rms errors are as much as a factor of four smaller than 
those from the corresponding Gaussian and one-term series 
expansion approximations. The three-term series expansion 
formula generally produces larger rms errors than does the 
two-term formula because the fit of the transitions regions of 35 
the distribution is less accurate. This trend is further exhib- 
ited by the four-term series expansion formula. For three of 
the calculations (i.e., sensor numbers 1 , 2 , and 6 ), the 
four-term approximations are unstable in the transition 
regions, resulting in rms errors that are much larger than 40 
those of the corresponding Gaussian approximations. The 
four-term series expansion formula produces an improved 
approximation for the residual signal from sensor number 5 
only. The residual signal for this accelerometer is the most 
nearly Gaussian of the sensors to begin with, as indicated by 45 
the rms errors for the Gaussian approximations for all six 
sensors. These results suggest that the higher order terms in 
the series expansion should be used with caution: only those 
PDFs that are nearly Gaussian to begin with should be 
approximated with terms higher than the second term in the 50 
series expansion. 

In the ASP method for near-Gaussian distributions, the 
central moments of the distribution to be approximated are 
used to evaluate the coefficients in the series expansion. A 
second approach, known as the optimized ASP method for 55 
near-Gaussian distributions, was also evaluated. In the sec- 
ond approach, the higher-order central moments (i.e., jll 3 , j h 4 , 
jll 5 , and ja 6 ) are treated as free parameters in the model and 
the approximation is optimized by searching for values of 
the central moments that provide the best fit of the distri- 60 
bution. In this approach, the simplex optimization method is 
used to minimize the rms error of the calculation, as 
described in A Simplex Method for Function Minimization, 
by J. A. Nelder and R. Mead, Computer Journal, Vol. 7, 
1965, at pages 308 through 313. The simplex method is 65 
particularly useful for minimizing complicated transcenden- 
tal functions of two or more variables because it minimizes 
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a function by utilizing evaluations of the function only — no 
evaluations of function derivatives are required. 

In the optimized ASP method for near-Gaussian distribu- 
tions, the first step is to select the approximation formula to 
be optimized, either the one, two, three, or four-term series 
expansion formula. The number of free parameters in the 
model is determined by the approximation formula used. For 
instance, the one-teim formula requires only one free param- 
eter (i.e., fi 3 ), whereas the four-term formula requires four 
free parameters (i.e., p, 3 , \ i 4 , p 5 , and p 6 ). The function that is 
minimized by the simplex algorithm is the rms error for the 
approximation, given by equation E68. The higher-order 
central moments of the distribution are used as initial values 
of the free parameters in the model. The simplex algorithm 
iterates on the free parameters until a minimum in the 
function is found, thereby producing the best fit (i.e., small- 
est rms error) between the calculated PDF and the residual 
signal PDF, for a given approximation formula. 

The rms errors of the calculations using the optimized 
ASP method for near-Gaussian distributions are listed in 
FIG. 18. Also included in FIG. 18 are the rms errors for 
Gaussian PDFs of the same mean, standard deviation, and 
area as the residual signal PDFs. Comparing the rms errors 
of the Gaussian approximations to those from the optimized 
calculations shows that all optimized series expansion for- 
mulas produce significant reductions in the rms error. Com- 
paring the data in FIG. 18 to those of FIG. 17 shows that the 
optimized ASP method produces more accurate approxima- 
tions than does the ASP method for each series expansion 
formula. For a given sensor, the optimized two-term series 
expansion formula produces rms errors that are roughly a 
factor of two smaller than those of the optimized one- term 
formula. The optimized three-term and four-term series 
expansion formulas though, produce rms errors that are 
roughly equivalent to those of the optimized two -term 
formula. Since the three-term and four-term models incor- 
porate more complicated functions with more free param- 
eters than the two-teim model, these results indicate that the 
optimized two-teim series expansion method provides the 
best balance between accuracy and efficiency. 

The PDF generated by the optimized two -term series 
expansion formula for the first sensor is shown in FIG. 19. 
The optimized two -term formula provides an excellent fit of 
the residual signal PDF across all regions of the distribution, 
including the transition regions. Comparing the results of the 
optimized ASP calculation in FIG. 19 to those from corre- 
sponding ASP calculations in FIG. 15 and FIG. 16, reveals 
that the main effect of the optimization is to improve the 
accuracy of the approximation through the transition regions 
of the distribution. 

The optimized ASP method for near-Gaussian distribu- 
tions produces an excellent fit of the residual signal distri- 
butions from the MSET model of SSME accelerometer 
signals. Because the optimized ASP method is tuned to the 
specific data distributions analyzed, the empirical error 
probabilities for the method will better adhere to the theo- 
retical limits for the fault detection procedure. 

A parametric study of the empirical false alarm rate was 
performed to compare the false alarm of the SPRT method 
to the new ASP method for the SSME residual signals. In the 
study, the residual signals generated by applying the six 
sensor MSET model to the accelerometer signals from flight 
STS057 engine 1 were analyzed with the SPRT and ASP 
sequential hypothesis tests at various values of the false 
alarm probability and the system disturbance magnitude 
parameters. Because of the high frequency of the acceler- 
ometer signals (10.24 kHz) and long duration of the flight 
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(542.4 sec), the residual signals contain over five million 
data values. The residual signal data set was so large that 
every tenth point was extracted from the signals and the fault 
detection models were applied to the subset. The subset of 
the residual signals was large enough to produce accurate 
statistics but small enough to permit multiple calculations in 
a reasonable time span. Typically, the sequential hypothesis 
tests make a decision for every 1 to 10 data points analyzed. 
Thus for the parametric calculations of the SSME residual 
signals, hundreds of thousands of decisions were made for 
each signal. By dividing the number of fault decisions from 
a SPRT analysis of a residual signal into the total number of 
fault and normal decisions made by the test, the empirical 
false alarm rate could be accurately evaluated. 

Referring to FIG. 20, the results from the application of 
the positive mean SPRT and ASP tests to the residual signal 
from the first sensor are shown. The calculations shown in 
FIG. 20 were performed with a constant preassigned missed 
alarm probability (p) of 0.01, a constant system disturbance 
magnitude (M) of 6, and a variable preassigned false alarm 
probability (a). The solid diagonal line in FIG. 20 represents 
the theoretical upper bound for the empirical false alarm 
probability, as defined by equation E39. For the SPRT 
calculation, the empirical false alarm rate satisfies the theo- 
retical upper bound only for preassigned false alarm prob- 
abilities greater than 0.003. For smaller values of the pre- 
assigned false alarm probability, the SPRT model reaches 
more fault decisions than one would expect based on theo- 
retical arguments. The theoretical upper bound for the 
empirical false alarm probability is not met for all values of 
the preassigned false alarm probability because the residual 
signals are not purely Gaussian, as confirmed in FIG. 13. 
The residual signal distribution has more data populating the 
tail regions than does a Gaussian distribution of the same 
mean, standard deviation, and area as the residual signal 
distribution. The data from the tail regions of a distribution 
are a major source of false alarms in the SPRT calculations, 
because they are more representative of the alternative 
hypothesis than they are of the normal hypothesis. Because 
the residual signals exhibit more heavily populated tail 
regions than true Gaussian distributions, they have a ten- 
dency to trigger more false alarms than anticipated, espe- 
cially at high levels of sensitivity (i.e., small values of the 
false alarm probability). Because the optimized ASP method 
is tuned to the specific data distributions exhibited by the 
residual signals, the empirical false alarm rate for the ASP 
calculation satisfies the theoretical upper bound for all 
values of the preassigned false alarm probability. The ASP 
method produces fewer false alarms than the SPRT method 
for all values of the preassigned false alarm probability. 

ASP Benefits and Applications: 

The Adaptive Sequential Probability (ASP) technique was 
developed as the fault detection element of a software 
program that reliably detects signal data faults for an asset, 
such as a process and/or apparatus. These signal validation 
modules improve safety, reduce maintenance cost, and 
enable optimal performance for a wide range of aeronauti- 
cal, industrial, chemical, power generating, medical, bio- 
logical, financial, and military assets. Signal validation is 
required in all types of process critical control and safety 
systems where unexpected process interruptions due to 
sensor or control component failures or false alarms are 
unsafe or uneconomical. Signal validation assures the safe, 
reliable operation of a process or apparatus and reduces the 
manpower, schedule and uncertainty associated with sensor 
and component failure detection. Signal validation prevents 
a facility safety or control system from making critical 
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decisions, such as the decision to shut down a process or 
abort a mission, on the basis of bad sensor data. Signal 
validation improves process quality and efficiency by ensur- 
ing that closed loop control or data analysis is performed 
5 using good data. Finally, signal validation increases system 
up time and decreases system maintenance cost by enabling 
sensor calibration using on-condition criteria rather than 
time-in- service criteria. 

Bayesian Belief Network (BBN) Example 
to In one example of the fault cause procedure use and 
operation, a test setup was configured for exposing two 
redundant hydrogen sensor assemblies to varying concen- 
trations of H 2 gas. Each of these sensor assemblies contains 
two H 2 sensor elements, one temperature sensor element, 
15 and one heater control element. A test system computer used 
the data acquisition means to monitor the signal sources 
consisting of the three sensor elements on each of the two 
sensors and actuated an alarm if hydrogen levels exceeded 
safety limits. As each observation was analyzed, the fault 
20 detection returned an array of ones and zeros indicating 
whether each signal’s reading was normal or abnormal. The 
Bayesian belief network (BBN) used the fault detection 
model’s output as positive findings to determine the state for 
each of its alarm indication leaf nodes. The BBN used this 
25 state information to determine the probability of any of the 
specified possible causes of the fault, thereby classifying the 
fault. 

The BBN applies the output from each fault detector 
associated with one of its alarm indication leaf nodes as a 
30 positive finding for the leaf node. The BBN combines this 
information from all of its leaf nodes to determine the 
probability of any of the specified possible causes of the 
fault. The BBN fault cause model or submodel configuration 
and probability tables can be different in each operating 
35 mode. However, in this example a similar BBN fault clas- 
sification model configuration was used in each of the two 
system operating modes. The configuration was as follows. 
Six fault nodes were created to determine the specific sensor 
element that had failed. Each sensor element node had two 
40 possible states, good (not faulty) and bad (faulty). Two 
additional fault nodes were created to determine whether the 
entire sensor had failed. These were named Sensorl and 
Sensor2. They also took on the states good and bad. If a node 
is in a good state, the implication is that the associated asset 
45 item is operating acceptably. For each of the fault nodes, we 
assign a prior probability. This is the probability that the 
proposition is true. For example, we believe that Sensorl 
and Sensor2 are reliable under the test conditions, so we 
believe they will behave correctly 95% of the time. This 
50 implies that they will behave incorrectly 5% of the time. 
This may be due to any number of reasons, e.g., not 
supplying power to the sensor, damage to the sensor’s 
cables, etc. The reasons for this failure are not of interest, 
only that each sensor can be expected to operate correctly 
55 95% of the time. We believe the probability of a bad element 
to be greater that the probability of a bad sensor, so we 
assigned a 10% prior probability to each of the elements 
failing. Conversely, there is 90% prior probability that each 
element has not failed. 

60 Each of the alarm indication nodes is a leaf node. Each 
leaf node is a child of at least one parent, and a conditional 
probability is required for each of the combinations of parent 
states. That is, given the state of each of the parents, there is 
a probability that the alarm leaf node will be in either the 
65 ‘‘normal” or “abnormal” state. Each fault node indicating a 
bad element has an associated alarm leaf node as a child. 
Each “bad sensor” node has three of the leaf nodes as 
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children. Therefore, each leaf node has two parents. The 
conditional probability tables require one entry for each state 
of the node for each combination of the node’s parent’s 
states. This yields a conditional probability table for each 
node that requires eight separate entries. 5 

Conditional probabilities for the leaf nodes were divided 
into three conditions. If all of the parent states were “good”, 
the probability that each of the child nodes were in a normal 
state was very high (95%). If either the associated sensor or 
the element were “bad”, the probability that the sensor was to 
behaving abnormally was also high (95%). If both the sensor 
and the element were “bad” the probability that the sensor 
was behaving abnormally was higher than if only one or the 
other were “bad”, so a 99% probability of failure was 
assigned. 15 

The system was exposed to H 2 at 0%, 0.1%, 0.5%, 1.0%, 
5.0%, 10.0% and 100.0% concentrations during normal 
operation. Training data was collected during these expo- 
sures from each of these elements at one-second intervals. 
Test data was taken several months later at 3.57% H 2 and 0% 20 
H 2 using the same sensors and test configuration. Next, these 
data were adjusted to create additional sets of training and 
test data wherein the effect of hydrogen tank venting in the 
vicinity of the sensors was simulated. Tank venting has the 
effect of increasing the background hydrogen concentration 25 
in the vicinity of the sensors and can cause the hydrogen 
detection system to produce undesirable false alarms or 
missed alarms. 

An operating mode determination procedure 31 was used 
to classify each observation on the basis of the tank vent 30 
valve state. The procedure classified the operating mode as 
OPERATING whenever the tank valve indication was less 
than 50% open and the operating mode as VENTING 
whenever the tank valve indication was more than 50% 
open. 35 

Two parameter estimation submodels 52 were trained on 
the training data, one for OPERATING and one for VENT- 
ING. Two fault detection submodels 54 were calibrated for 
each included sensor on the basis of their corresponding 
submodel estimates taken over the training data for their 40 
respective operating modes. Each fault detector returned an 
array of ones and zeros indicating whether each sensor 
element’s reading was normal or abnormal. Test results 
demonstrated that the BBN is effective for diagnosing faults 
detected by the parameter estimation and fault detection 45 
procedures. Nominal (OPERATING) test data consisted of 
data obtained during exposure to 3.57% H 2 , and during 
exposure to 0% H 2 using the sensors that had been previ- 
ously used to acquire the training data. As expected, the 
system accurately predicted the data and generated no 50 
alarms. 

While a BBN fault cause or classification procedure 84 is 
described by example, a variety of types of fault cause 
classification procedure can be used with the instant inven- 
tion. 55 

Accordingly, in one aspect, the present invention provides 
a new, novel and useful surveillance system: apparatus and 
method. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 60 
unconstrained form of a residual error probability density 
function model used in said surveillance system’s fault 
detection method. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method which can per- 65 
form high sensitivity surveillance for a wide variety of assets 
including industrial, utility, business, medical, transporta- 
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tion, financial, and biological processes and apparatuses 
wherein such process and/or apparatus asset preferably has 
at least one pair of redundant actual and/or virtual signals. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of a 
statistical hypothesis test surveillance decision procedure 
that uses a fault detection model comprised of a probability 
density function model of a residual error signal that is of an 
unconstrained form. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved fault detection model for a process surveillance 
scheme using recorded operating data for an asset to train a 
fault detection model. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved system and method for surveillance of on-line, 
real-time signals, or off-line accumulated signal data. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved system and method for surveillance of signal 
sources and detecting a fault or error state of the signal 
sources enabling responsive action thereto. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved system and method for surveillance of signal 
sources and detecting a fault or error state of the asset 
processes and apparatuses enabling responsive action 
thereto. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved decision method as to the accuracy or validity for 
at least one process signal parameter given an observation of 
at least one actual signal from the asset. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved system and method for ultra- sensitive detection of 
a fault or error state of signal sources and/or asset processes 
and apparatuses wherein the parameter estimation technique 
used for the generation of at least one virtual signal param- 
eter is a multivariate state estimation technique (MSET) 
having any one of a plurality of pattern recognition matrix 
operators, training procedures, and operating procedures. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved system and method for ultra- sensitive detection of 
a fault or error state of signal sources and/or asset processes 
and apparatuses wherein the parameter estimation technique 
used for the generation of at least one virtual signal param- 
eter is a kernel regression technique having any one of a 
plurality of structures, training procedures, and operating 
procedures. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved system and method for ultra- sensitive detection of 
a fault or error state of signal sources and/or asset processes 
and apparatuses wherein the parameter estimation technique 
used for the generation of at least one virtual signal param- 
eter is a neural network having any one of a plurality of 
structures, training procedures, and operating procedures. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved system and method for ultra- sensitive detection of 
a fault or error state of signal sources and/or asset processes 
and apparatuses wherein the parameter estimation technique 
used for the generation of at least one virtual signal param- 
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eter is a mathematical process model having any one of a 
plurality of structures, training procedures, and operating 
procedures. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved system and method for ultra- sensitive detection of 
a fault or error state of signal sources and/or asset processes 
and apparatuses wherein the parameter estimation technique 
used for the generation of at least one virtual signal param- 
eter is an autoregressive moving average (ARMA) model 
having any one of a plurality of structures, training proce- 
dures, and operating procedures. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of an 
improved system and method for ultra- sensitive detection of 
a fault or error state of signal sources and/or asset processes 
and apparatuses wherein the parameter estimation technique 
used for the generation of at least one virtual signal param- 
eter is a Kalman filter model having any one of a plurality 
of structures, training procedures, and operating procedures. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of a 
novel system and method for using at least one of a plurality 
of methods to classify the state of a residual error signal 
produced by the mathematical difference between two sig- 
nals, said two signals being either actual and/or predicted 
signals, into one of at least two categories. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of a 
novel system and method to classify the state of a residual 
error signal wherein said classification is made to distinguish 
between a normal signal and an abnormal signal. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of a 
novel system and method to classify the state of a residual 
error signal wherein said classification is performed using a 
statistical hypothesis test having any one of a plurality of 
probability density function models, training procedures, 
and operating procedures. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method comprised of a 
novel system and method to classify the state of a residual 
error signal wherein said classification is performed using a 
probability density function model having any one of a 
plurality of structures, training procedures, and operating 
procedures. 

In another aspect, an embodiment of invention provides a 
surveillance system: apparatus and method having fault 
cause and operating mode partitioning. 

Moreover, it should be apparent that numerous structural 
modifications and adaptations may be resorted to without 
departing from the scope and fair meaning of the present 
invention as set forth hereinabove and as described herein 
below by the claims. 

I claim: 

1. A method for performing surveillance of an asset, the 
steps including: 

obtaining residual data correlative to asset operation; 

fitting a mathematical model to the obtained residual data 
and storing the mathematical model in a memory 
means; 

obtaining a current set of observed signal data from the 
asset; 

using the mathematical model in a sequential hypothesis 
test for determining if the current set of observed signal 
data is indicative of an asset fault condition; 
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determining an asset fault cause correlative to a deter- 
mined indication of an asset fault condition; and 

providing an indication correlative to a determined asset 
fault cause for providing asset surveillance. 

5 2. The method of claim 1 wherein the step of determining 

an asset fault cause includes a step of determining a most 
probable asset fault cause from a plurality of possible asset 
fault causes. 

3. The method of claim 2 wherein the step of determining 
to the most probable asset fault cause from the plurality of 

possible asset fault causes includes a step of utilizing a 
Bayesian belief network to determine the probability of at 
least one possible asset fault cause. 

4. The method of claim 1 further including a step of 
15 providing a response to a provided indication correlative to 

a determined asset fault cause. 

5. The method of claim 4 wherein the step of providing 
the response includes a step of providing a control action. 

6. The method of claim 4 wherein the step of providing 
20 the response includes a step of providing an alarm action. 

7. The method of claim 1 wherein the step of obtaining 
residual data correlative to asset operation includes a step of 
transforming the residual data to remove noise from the 
residual data. 

25 8. The method of claim 1 wherein the step of obtaining 

residual data correlative to asset operation includes a step of 
obtaining residual data correlative to at least two modes of 
asset operation. 

9. The method of claim 8 further including a step of 
30 dividing the obtained residual data into a plurality of 

residual data subsets each having at least one mode of asset 
operation associated thereto. 

10. The method of claim 9 further including a step of 
fitting at least one mathematical model to at least one of the 

35 plurality of residual data subsets for defining at least the one 
mathematical model stored in the memory means. 

11. A method for performing surveillance of an asset, the 
steps including: 

obtaining a current set of observed signal data from the 
40 asset; 

determining a mode of asset operation correlative to the 
current set of observed signal data obtained from the 
asset; 

retrieving at least one mathematical model from a 
45 memory means as a function of the determined mode of 
asset operation for obtaining at least one retrieved 
mathematical model, and 

using at least the one retrieved mathematical model in a 
sequential hypothesis test for determining if the current 
50 set of observed signal data is indicative of an asset fault 
condition for providing asset surveillance. 

12. The method of claim 1 further including a step of 
obtaining residual data correlative to at least two modes of 
asset operation. 

55 13. The method of claim 12 further including a step of 

dividing the obtained residual data into a plurality of 
residual data subsets each having at least one mode of asset 
operation associated thereto. 

14. The method of claim 13 further including a step of 
60 fitting at least one mathematical model to at least one of the 

plurality of residual data subsets for defining at least the one 
mathematical model stored in the memory means. 

15. The method of claim 14 wherein the fitting step 
includes a step of fitting at least one mathematical function 

65 correlative to a frequency distribution to at least one of the 
plurality of residual data subsets for defining at least the one 
mathematical model stored in the memory means. 
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16. The method of claim 11 further including a step of 
determining an asset fault cause correlative to a finding of an 
asset fault condition. 

17. The method of claim 16 wherein the step of deter- 
mining an asset fault cause includes a step of determining a 
most probable asset fault cause from a plurality of possible 
asset fault causes. 

18. The method of claim 17 further including a step of 
providing an indication correlative to a determined most 
probable asset fault cause. 

19. The method of claim 18 further including a step of 
providing a response to a provided indication correlative to 
a determined most probable asset fault cause. 

20. The method of claim 19 wherein the step of providing 
the response includes a step of providing a control action. 

21. The method of claim 19 herein the step of providing 
the response includes a step of providing an alarm action. 

22. A system for performing surveillance of an asset, said 
system comprising in combination: 

means for obtaining a current set of observed signal data 
from the asset; 

means for performing a sequential hypothesis test for 
determining if the current set of observed signal data is 
indicative of an asset fault condition; 

means for determining an asset fault cause correlative to 
a determined indication of an asset fault condition; and 

means for providing an indication correlative to a deter- 
mined asset fault cause for providing asset surveillance. 

23 . The combination of claim 22 further including means 
for obtaining residual data correlative to at least two modes 
of asset operation and means for dividing the acquired 
residual data into a plurality of residual data subsets each 
having at least one mode of asset operation associated 
thereto. 

24. The combination of claim 23 further including means 
for fitting at least one mathematical model to at least one of 
the plurality of residual data subsets and storing at least the 
one mathematical model in a memory means. 

25. The combination of claim 24 further including means 
for determining a mode of asset operation correlative to the 
current set of observed signal data obtained from the asset, 
means for retrieving at least one mathematical model from 
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said memory means as a function of the determined mode of 
asset operation for obtaining at least one retrieved math- 
ematical model, and means for using at least the one 
retrieved mathematical model in performing the sequential 

5 hypothesis test for determining if the current set of observed 
signal data is indicative of an asset fault condition. 

26. The combination of claim 22 wherein said determin- 
ing means includes means for determining a most probable 
asset fault cause correlative to a determined indication of an 

to asset fault condition from a plurality of possible asset fault 
causes. 

27. The combination of claim 26 further including means 
for providing an indication correlative to a determined most 
probable asset fault cause. 

15 28. The combination of claim 27 further including means 

for providing a response to an indication correlative to a 
determined most probable asset fault cause. 

29. The combination of claim 28 wherein said means for 
providing a response includes means for providing a control 

20 action to an indication correlative to a determined most 
probable asset fault cause. 

30. The combination of claim 28 wherein said means for 
providing a response includes means for providing an alarm 
action to an indication correlative to a determined most 

25 probable asset fault cause. 

31. A method for performing surveillance of an asset, the 
steps including: 

obtaining a current set of observed signal data from the 
asset; 

30 performing a sequential hypothesis test for determining if 
the current set of observed signal data is indicative of 
an asset fault condition; 

determining an asset fault cause correlative to a deter- 
mined indication of an asset fault condition; and 

35 providing an indication correlative to a determined asset 
fault cause for providing asset surveillance. 

32. A computer readable medium having a program 
recorded thereon in which the program causes, in use, a 
computer running the program to execute a procedure 

40 according to the method of claim 31. 



